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Part one 


Translator’s introduction 


Translator’s introduction 


0.1 General remarks 


Dedekind’s invention of ideals in the 1870s was a major turning point in 
the development of algebra. His aim was to apply ideals to number the- 
ory, but to do this he had to build the whole framework of commutative 
algebra: fields, rings, modules and vector spaces. These concepts, to- 
gether with groups, were to form the core of the future abstract algebra. 
At the same time, he created algebraic number theory, which became 
the temporary home of algebra while its core concepts were growing 
up. Algebra finally became independent in the 1920s, when fields, rings 
and modules were generalised beyond the realm of numbers by Emmy 
Noether and Emil Artin. But even then, Emmy Noether used to say 
“Es steht schon bei Dedekind” (“It’s already in Dedekind”), and urged 
her students to read all of Dedekind’s works in ideal theory. 

Today this is still worthwhile, but not so easy. Dedekind wrote for 
an audience that knew number theory - especially quadratic forms — 
but not the concepts of ring, field or module. Today’s readers probably 
have the opposite qualifications, and of course most are not fluent in 
German and French. In an attempt to overcome these problems, I have 
translated the most accessible of Dedekind’s works on ideal theory, Sur 
la Théorie des Nombres Entiers Algébriques, Dedekind (1877), which he 
wrote to explain his ideas to a general mathematical audience. This 
memoir shows the need for ideals in a very concrete case, the numbers 
m+n/—5 where m,n € Z, before going on to develop a general theory 
and to prove the theorem on unique factorisation into prime ideals. 

The algebraic integers in Dedekind’s title are a generalisation of the 
ordinary integers — created in response to certain limitations of classical 
number theory. The ordinary integers have been studied since ancient 
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times, and their basic theory was laid down in Euclid’s Elements (see 
Heath (1925)) around 300 BC. Yet even ancient number theory contains 
problems not solvable by Euclid’s methods. Sometimes it is necessary 
to use irrational numbers, such as 2, to answer questions about the 
ordinary integers. A famous example is the so-called Pell equation 


z* —cy? =1 

where c is a nonsquare integer and the solutions x, y are required to be 
integers. Solutions for certain values of c were known to the ancients, but 
the complete solution was not obtained until Lagrange (1768) related the 
equation to the continued fraction expansion of ,/c. He also showed that 
each solution is obtained from a certain “minimum” solution (29, yo) by 
the formula 


re t+ yee = (xo + yove)*. 


The irrational numbers x, + y,./¢ in this formula are examples of alge- 
braic integers, which are defined in general to be roots of equations of 
the form 


a” + an—190" 1} +---+a;a+ a9 =0 


where @,-1,-.-,@9 are ordinary integers, that is, an—1,...,@9 € Z. 

Algebraic integers are so called because they share some properties 
with the ordinary integers. In particular, they are closed under sum, 
difference and product, and the rational algebraic integers are just the 
ordinary integers (for more details, see 0.6.2 and §13 of Dedekind’s mem- 
oir). Because of the second fact, the ordinary integers are also known 
as rational integers. The first fact implies that the algebraic integers 
form a ring. However, we are not interested in the ring of all algebraic 
integers so much as rings like 


Ziv = {c + yv2:: 2,y €Z} 


Zi] = {a+ yi: 2,y € Z}. 


In general we use the notation Z[a] to denote the closure of the set 
ZU {a} under +, — and x. The reason for working in rings Z[a] is 
that they more closely resemble Z, and hence are more likely to yield 
information about Z.: 

Any ring R of algebraic integers includes Z, so theorems about Z 
may be obtainable as special cases of theorems about R (we shall see 


0.1 General remarks 5 


several examples later). However, useful theorems about R are provable 
only when RF has all the basic properties of Z, in particular, unique 
prime factorisation. This is not always the case. Z[,/—5] is the simplest 
example where unique prime factorisation fails, and this is why Dedekind 
studies it in detail. His aim is to recapture unique prime factorisation by 
extending the concept of integer still further, to certain sets of algebraic 
integers he calls ideals. This works only if the size of R is limited in some 
way. The ring A of all algebraic integers is “too big” because it includes 
a along with each algebraic integer a. This gives the factorisation 
a = Va,/a and hence “primes” do not exist in A, let alone unique 
prime factorisation. 

Dedekind found the appropriate “small” rings R in algebraic number 
fields of finite degree, each of which has the form Q(a), where a is an 
algebraic integer. Q(a) denotes the closure of Q U {a} under +,—,x 
and + (by a nonzero number), and each Q(a) has its own integers, 
which factorise into primes. In particular, Z[,/—5] is the ring of integers 
of Q(./—5), and 6 = 2 x 3 is a prime factorisation of 6. Not the prime 
factorisation, alas, because 6 = (1++/—5)(1—/—5) is also a factorisation 
into primes (see 0.4.5). However, unique prime factorisation is regained 
when one passes to the ideals of Z[,/—5], and Dedekind generalises this 
to any Q(a). The result is at last a theory of algebraic integers capable 
of yielding information about ordinary integers. 

A lot of machinery is needed to build this theory, but Dedekind ex- 
plains it well. Suffice to say that fields, rings and modules arise very 
naturally as sets of numbers closed under the basic operations of arith- 
metic. Fields are closed under +, —, x and +, rings are closed under +, 
— and x, while modules are closed under + and —. The term “ring” was 
actually introduced by Hilbert (1897); Dedekind calls them “domains” 
here, and I have thought it appropriate to retain this terminology, since 
these particular rings are prototypes of what are now called Dedekind 
domains. Dedekind presumably chose the name “module” because a 
module M is something for which “congruence modulo M” is meaning- 
ful. His name for field, Kérper (which also means “body” in German), 
was chosen to describe “a system with a certain completeness, fullness 
and self-containedness; a naturally unified, organic whole”, as he ex- 
plained in his final exposition of ideal theory, Dedekind (1894), §160. 

What Dedekind does not explain is where Z[,/—5] comes from, and 
why it is important in number theory. This is understandable, because 
his first version of ideal theory was a supplement to Dirichlet’s number 
theory lectures, Vorlesungen tiber Zahlentheorie (Dirichlet (1871)). In 
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the present memoir he also refers to the Vorlesungen frequently, so his 
original audience was assumed to have a good background in number 
theory, and particularly the theory of quadratic forms. Such a back- 
ground is less common today, but is easy and fun to acquire. Even 
experts may be surprised to learn how far back the story goes. The 
specific role of /—5 can be traced back to the anomalous behaviour of 
the quadratic form x? + 5y”, first noticed by Fermat, and later explained 
in different ways by Lagrange, Gauss and Kummer. But the reason for 
Fermat’s interest in x? + 5y? goes back much further, perhaps to the 
prehistory of mathematics in ancient Babylon. Let us begin there. 


0.2 Squares 
0.2.1 Pythagorean triples 


Integers a, b, c such that 
+P=c? 


are one of the oldest treasures of mathematics. Such numbers occur as 
the sides of right-angled triangles, and they may even have been used to 
construct right angles in ancient times. They are called Pythagorean 
triples after Pythagoras, but they were actually discovered indepen- 
dently in several different cultures. The Babylonians were fascinated 
by them as early as 1800 BC, when they recorded fifteen of them on a 
tablet now known as Plimpton 322 (see Neugebauer and Sachs (1945)). 
Pythagorean triples other than the simplest ones (3,4,5), (5,12,13) or 
(8,15,17) are not easily found by trial and error, so the Babylonians 
probably knew a general formula such as 
a=2u, b=w-v*?, c=u?+v? 
which yields an unlimited supply of Pythagorean triples by substituting 
different integers for u, v. 
The general solution of a? + b? = c? is in fact 


a=2uvw, b=(u?—v?)w, c= (u? +"), 


as may be found in Euclid’s Elements Book X (lemma after Proposition 
29). A key statement in Euclid’s proof is: if the product of relatively 
prime integers is a square, then the integers themselves are squares. 
Euclid first used the general formula for Pythagorean triples in his the- 
ory of irrational numbers, and it is in a different book from his theory 
of integers. The assumption that relatively prime integers are squares 
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when their product is a square is justified by a long chain of proposi- 
tions, stretching over several books of the Elements. However, a direct 
justification is possible from his theory of integer divisibility, which is in 
Book VII. This theory is fundamental to the theory of ordinary integers, 
and also the inspiration for Dedekind’s theory of ideals, so we should re- 
call its main features before going any further. Among other things, it 
identifies the important but elusive role of primes. 


0.2.2 Divisors and prime factorisation 


An integer m divides an integer n if n = ml for some integer |. We also 
say that m is a divisor of n, or that n is a multiple of m. An integer 
p whose only divisors are +1 and +p is called a prime, and any integer 
can be factorised into a finite number of primes by successively finding 
divisors unequal to +1 but of minimal absolute value. However, it is not 
obvious that each factorisation of an integer n involves the same set of 
primes. There is conceivably a factorisation of some integer 


n= Pip2-**Pi = 9192°°° GG 


into primes p1,p2,...,p; and qi, Q2,--.-,q; respectively, where one of the 
primes p is different from all the primes gq. 

Nonunique prime factorisation is ruled out by the following proposi- 
tion of Euclid (Elements, Book VII, Proposition 30). 


Prime divisor property. If p is prime and p divides the product ab of 
integers a, b, then p divides a or p divides b. 


An interesting aspect of the proof is its reliance on the concept of 
greatest common divisor (gcd), particularly the fact that 


gcd(a,b) = ua+vb for some integers u, v. 


The set {ua+vb: u,v € Z} is in fact an ideal, and unique prime factori- 
sation is equivalent to the fact that this ideal consists of the multiples 
of one of its members, namely gcd(a, b). 

It should be mentioned that Euclid proves only the prime divisor prop- 
erty, not unique prime factorisation. In fact its first explicit statement 
and proof are in Gauss (1801), the Disquisitiones Arithmeticae, article 
16. As we shall see, this is possibly because Gauss was first to recognise 
generalisations of the integers for which unique prime factorisation is 
not valid. 
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0.2.3 Irrational numbers 


As everybody knows, Pythagorean triples also have significance as the 
sides of right-angled triangles. In any right-angled triangle, the side 
lengths a, b, c satisfy 

a? +b? =? 


whether or not a, b and c are integers (Pythagoras’ theorem). Hence it 
is tempting to try to interpret a right-angled triangle as a Pythagorean 
triple by choosing the unit of length so that a, b and c all become integer 
lengths. Pythagoras or one of his followers made the historic discovery 
that this is not always possible. The simplest counterexample is the 
triangle with sides 1, 1, /2. It is impossible to interpret this triangle as 
a Pythagorean triple because 2 is not a rational number. 

A proof of this fact, which also proves the irrationality of /3, V5, 
V6 and so on, uses unique prime factorisation to see that each prime 
appears to an even power in a square. Then the equation 


2n? = m? 
is impossible because the prime 2 occurs an odd number of times in the 
prime factorisation of 2n?, and an even number of times in the prime 
factorisation of m?. 

The irrationality of /2 led the Greeks to study the so-called Pell 
equation 


x? —Qy? = 1, 


They found it could used to approach V2 rationally, via increasingly 
large integer solutions, tn, yn. Since z2 — 2y? = 1, the quotient rp/yn 
necessarily tends to 2. The general Pell equation 


x* —cy2=1, where c is a nonsquare integer, 


can similarly be used to approach the irrational number ,/c. This equa- 
tion later proved fruitful in many other ways; Dedekind even used it to 
prove the irrationality of ,/c (Dedekind (1872), Section IV). 


0.2.4 Diophantus 


The equations a?+b? = c? and x*—2y? = 1 are examples of what we now 
call Diophantine equations, after Diophantus of Alexandria. Diophantus 
lived sometime between 150 AD and 350 AD and wrote a collection of 
books on number theory known as the Arithmetica (Heath (1910)). They 
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consist entirely of equations and ingenious particular solutions. The 
term “Diophantine” refers to the type of solution sought: either rational 
or integer. For Diophantus it is usually a rational solution, but for some 
equations, such as the Pell equation, the integer solutions are of more 
interest. The Pell equation was actually not studied by Diophantus, but 
he mentioned an integer solution to another remarkable equation: the 
solution zx = 5, y = 3 of y3 = 27 +2. (See 0.4.1 for the astonishing 
sequel to this solution.) 

Although all Diophantus’ solutions are special cases, they usually seem 
chosen to illustrate general methods. Euler (1756) went so far as to say 


Nevertheless, the actual methods that he uses for solving any of his problems 
are as general as those in use today ... there is hardly any method yet invented 
in this kind of analysis not already traceable to Diophantus. (Euler Opera 
Omnia 1,2, p. 429-430.) 


And if anyone would know, Euler would. The first mathematician to 
understand Diophantus properly was Fermat (1601-1665), but his com- 
ments were as cryptic as the Arithmetica itself. Euler spent about 40 
years, off and on, reading between the lines of Fermat and Diophantus, 
until he could reconstruct most their methods and prove their theorems. 
We shall study the connection between Diophantus, Fermat and Euler 
more thoroughly later, but one example is worth mentioning here. It 
shows how much theory can be latent in a single numerical fact. 
In the Arithmetica, Book II, Problem 19, Diophantus remarks 


65 is naturally divided into two squares in two ways, namely into 7° + 4? and 
8? + 17, which is due to the fact that 65 is the product of 13 and 5, each of 
which is the sum of two squares. 


It appears from this that Diophantus is aware of the identity 
(a? + b?)(c? +d?) = (ac + bd)” + (ad $ bc)? 


though he makes no such general statement. However, Fermat saw much 
deeper than this. Noticing, with Diophantus, that the identity reduces 
the representations of a number as a sum of two squares to the repre- 
sentations of its prime factors, his comment on the problem is: 


A prime number of the form 4n+1 is the hypotenuse of a right-angled triangle 
[that is, a sum of squares] in one way only . . . If a prime number which is the 
sum of two squares be multiplied by another prime number which is also the 
sum of two squares, the product will be the sum of two squares in two ways. 
(Heath (1910), p. 268.) 
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The restriction to primes of the form 4n + 1 is understandable because 
a prime p # 2 cannot be the sum of two squares unless it is of the form 
4n +1 (by a congruence mod 4 argument). But Fermat’s claim that any 
prime p = 4n + 1 is a sum of two squares comes right out of the blue. 
No one knows how he proved it and the first known proof is due to Euler 
(1756). As we shall see later, Lagrange, Gauss and Dedekind all used 
this theorem of Fermat to test the strength of new methods in number 
theory. 


0.3 Quadratic forms 
0.3.1 Fermat 


Unlike Euclid or Diophantus, Fermat never wrote a book. His reputa- 
tion rests on a short manuscript containing his discovery of coordinate 
geometry (independent of Descartes), his letters, and his marginal notes 
on Diophantus. He took up number theory only in his late 30s, and 
left only one reasonably complete proof, in the posthumously published 
Fermat (1670). However, it is a beautiful piece of work, and fully estab- 
lishes his credentials as both an innovator and a student of the ancients. 
It also has a place in our story, as an application of Pythagorean triples, 
and as the first proven instance of Fermat’s last theorem. Fermat’s proof 
shows that there are no positive integers x, y, z such that z4+y4 = z4, 
by showing that there are not even positive integers z, y, z such that 
z+ +y* = z?. It turns out to be the only instance of Fermat’s last the- 
orem with a really elementary proof, involving just Euclid’s theory of 
divisibility. 

The argument is by contradiction, and the gist of it is as follows (omit- 
ting mainly routine checks that certain integers are relatively prime). 

Suppose that there are positive integers x, y, z such that x*+y* = z?, 
or in other words, (x*)* + (y”)? = z? . This says that x”, y*, z is a 
Pythagorean triple, which we can take to be primitive, hence there are 
integers u, v such that 


e=Quv, yre=u?—v*, z=u? 4+’, 


by Euclid’s formulas (0.2.1). The middle equation says that v, y, u 
is also a Pythagorean triple, and it is also primitive, hence there are 
integers s, t such that 


v=2st, y=s?*-t?, u=s*4+??. 
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This gives 
x? = 2uv = 4st(s* + t’), 


so the relatively prime integers s, t and s? + t? have product equal to 
the square (1/2). It follows that each is itself a square, say 


$= 22, t=y?, v4t = 22, 


and hence 


4,,4_ 22 
Xyty = 24. 


Thus we have found another sum of two fourth powers equal to a square, 
and by retracing the argument we find that the new square z? is smaller 
than the old, z?, but still nonzero. By repeating the process we can 
therefore obtain an infinite descending sequence of positive integers, 


which is a contradiction. oO 


Fermat called the method used in this proof infinite descent, and used 
it for many of his other theorems. He claimed, for example, to have 
proved that any prime of the form 4n + 1 is a sum of two squares by 
supposing p = 4n + 1 to be a prime not the sum of two squares, and 
finding a smaller prime with the same property. However, it is very hard 
to see how to make the descent in this case. Euler (1749) found a proof 
only after several years of effort. In 0.3.4 we shall see an easier proof 
of the two squares theorem due to Lagrange. Lagrange’s proof does 
use another famous theorem of Fermat, but it is the easy one known 
as Fermat’s “little” theorem: for any prime number p, and any integer 
a #0 (mod p), we have a?-1 = 1 (mod p) (Fermat (1640b)). 

The proof of Fermat’s little theorem most likely used by Fermat uses 
induction on a and the fact that a prime p divides each of the binomial 
coefficients 

(*) = Me Dip) nitd) for 1<i<p-l, 
as is clear from the fact that p is a factor of the numerator but not of 
the denominator. This proof implicitly contains the “mod p binomial 
theorem”, 


(a +b)? = a? + BP (mod p), 
which has its uses elsewhere (see for example Gauss’s proof of quadratic 


reciprocity in 0.5.4). 
The proof more often seen today is based on that of Euler (1761), 
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which implicitly uses the group properties of multiplication mod p, par- 
ticularly the idea of multiplicative inverses. An integer a is nonzero 
mod p if ged(a,p) = 1, in which case 1 = ar + ps for some integers 
r, s by the Euclidean algorithm. The number r is called a multi- 
plicative inverse of a (mod p) since ar = 1 (mod p). It follows that 
mod p multiplication by a nonzero a is invertible, and in particular the 
set {a x la x 2,...,a x (p— 1)} is the same set (mod p) as the set 
{1,2,...,p—1}. Hence each set has the same product mod p, 


ax2ax---x(p-—l)a=1x2x---x(p—1) (mod p), 


and cancellation of 1,2,...,p— 1 from both sides (which is permissible, 
since 1,2,...,p — 1 have inverses) gives Fermat’s little theorem: 


a?~1 = 1 (mod p). 


0.3.2 The grit in the oyster 


The mathematical pearl that is Dedekind’s theory of ideals grew in re- 
sponse to a tiny irritant, the anomalous behaviour of the quadratic form 
x? +5y*. Between 1640 and 1654 Fermat discovered three beautiful the- 
orems about the representation of odd primes p by the forms x? + y’, 
x? + 2y and x? + 3y? for integer values of x and y (the first prompted 
by Diophantus’ remark on sums of squares, as mentioned in 0.2.4): 


Theorem 1. p=2x?+y? = p=1 (mod 4) (Fermat (1640c)) 
Theorem 2. p= 2x? +2y? = p=lor3(mod 8)  (Fermat (1654)) 
Theorem 3. p= 2? + 3y* = p=1 (mod 8) (Fermat (1654)) 
Fermat thought he could prove these theorems, and he was probably 
right, as proofs eventually published by Euler were based on Fermat’s 
method of infinite descent. Since x? + 4y? = x* + (2y)*, which is “of 
the form” 2? + y?, the next theorem should be about x? + 5y?. This 


is the grit in the oyster. Fermat was unable to prove a theorem about 
primes of the form x? +5y?, and could only conjecture the following less 


satisfying fact. 


If two primes which end in 3 or 7 and surpass by 3 a multiple of 4 are multiplied, 
then their product will be composed of a square and the quintuple of another 
square. (Fermat (1654).) 


Since numbers that end in 3 or 7 are of the form 10n +3 or 10n+7, and 
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such a number is also of the form 4m + 3 if and only if it is of the form 
20k + 3 or 20k + 7, Fermat’s conjecture can be restated as follows: 


Fermat’s conjecture. Jf two primes are of the form 20k+3 or 20k+7 
then their product is of the form x? + 5y?. 


This is very puzzling. What about primes of the form x? + 5y? ? 
There are some, such as 


29 = 3745 x 2?, 
41 = 6745 1?, 
61 = 47 +5 x 3?, 


and furthermore, they lie in the classes 20k + 1 and 20k + 9 that seem 
conspicuously absent from the conjecture above. This situation begs for 
an explanation. As Weil (1974) says: 

When there is something that is really puzzling and cannot be understood, 


it usually deserves the closest attention because some time or other some big 
theory will emerge from it. 


Euler (1744) found another clue to the puzzle. He noticed two “faces” to 
the behaviour of x? +5y” — sometimes it represents a prime p, sometimes 
2p — and he conjectured that the following holds for all prime values p. 
Euler’s conjecture. p= 2x" +5y? <= p=1 or 9 (mod 20) 

2p = x2 +5y? —& p=3 or 7 (mod 20). 


0.3.3 Reduction of forms 


The first to account rigorously for the two-faced behaviour of x?+5y? was 
Lagrange (1773). Studying the general question of which integers n could 
be represented by a given quadratic form ax? + bry+ cy? (where a,b,c € 
Z), he had the very fruitful idea of finding those forms a’x’*-+0/z'y’+c'y'” 
equivalent to az? + bry + cy? via a change of variables. 


The substitution 
x =ax+ By, 
y = ya + by 
maps ZZ into ZxZ provided a, 3,7, 6 € Z and it is one-to-one provided 
there is an inverse substitution 
r=a'e'+ py’, 
yay 2+ oy’. 
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[So]ly el-[o a}, 


since the product of a substitution and its inverse is the identity. There- 
fore, taking determinants of both sides, 

a B a’ p' | ay 
y 6 7! é! = ee 
Finally, since the determinants on the left are integers and the only 
integer divisors of 1 are +1, it follows that the invertible substitutions 
for which the pairs (x’, y’) run through Z x Z are precisely those with 
a,Z,y,6 € Z and determinant a6 — Gy = +1. Such substitutions are 
now called unimodular. 

The result a’x!+b'z'y!+c’y’” of a unimodular substitution a’ x’ +/’y’ 
for x and yz’ + 6'y’ for y in ax” + bry + cy? is therefore a form that 
takes the same values as az? + bry + cy”. Forms transformable into 
each other by unimodular substitutions are equivalent, as we would say, 
because the unimodular substitutions form a group. Lagrange observed 
that equivalent forms have the same discriminant 


D=08* — 4ac = 0” — 4a'c’, 


In this case 


as can be checked by computing b/? — 4a’c’ and using a6 — By = +1. 
(Incidentally, the old term for the discriminant of a quadratic form was 
determinant. I have retained this term in the translation of Dedekind’s 
memoir because he refers to a slightly different definition, due to Gauss.) 
Observing the invariance of the discriminant is a first step towards decid- 
ing equivalence of forms. To go further we need to answer the question: 
how many inequivalent forms have the same discriminant? 

Lagrange found a way to answer this question for forms with negative 
discriminant. He showed that any ax? + bay + cy? can be transformed 
into an equivalent form a’z’” + b'z'y’ + cy’? that is reduced in the sense 
that |b'| <a’ <c’. 

It follows that 


—D = 4a'd — 8 > 4a’? — a" = 30 


and therefore, in the case of negative discriminant, only finitely many 
values of the integers a’, b’ and hence c’, can occur in reduced forms. 
For any particular D < 0 it is then possible to work out the inequivalent 
reduced forms of discriminant D. The number of them is called the class 
number h(D). The first few calculations yield the following results. 
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All forms with discriminant —4 are equivalent to x? + y?, hence 
h(—4) = 1. 
All forms with discriminant —8 are equivalent to x? + 2y?, hence 
h(—8) = 1. 
All forms with discriminant —12 are equivalent to x? + 3y?, hence 
h(-12) = 1. 


But there are two inequivalent reduced forms with discriminant —20, 
the forms x? + 5y? and 2x? + 2ry + 3y”, so h(—20) = 2. Aha! There 
is something different about x? + 5y?! Perhaps the previously invisible 
companion 2z27+2zry+3y? accounts for its “two-faced” behaviour. Before 
following up this suggestion, however, let us see how Lagrange used the 
uniqueness of the form x? + y” to prove the two squares theorem. 


0.3.4 Lagrange’s proof of the two squares theorem 


Given a prime p = 4n +1, it suffices to find any form with discriminant 
—4 that represents p, since Lagrange has shown we can transform this 
form into 2? + y?. It suffices in turn to find an integer m such that 
p divides m? + 1, because in this case the form pr? + 2mzy + mt) y? 
has integer coefficients, its discriminant is —4, and of course it takes 
the value p for s = 1, y = 0. (The use of this particular form is a 
simplification of Lagrange’s argument due to Gauss (1801), article 182). 

This is where it is crucial that p be of the form 4n + 1 and prime. By 
Fermat’s little theorem, p then divides 24” — 1 = (z?" — 1)(z?" + 1) for 
any integer z relatively prime to p. Thus p will divide z2” + 1, which 
is m? + 1 with m = 2”, provided z is chosen so that p does not divide 
z7” — 1. In terms of congruences, we want to choose one of the 4n 
nonzero values of z (mod p) so that 22” — 1 is nonzero (mod p). This 
is possible by an earlier theorem of Lagrange (1770): a congruence of 
degree g modulo a prime p has at most g solutions. 

A modern proof of this theorem (not much different from Lagrange’s 
own proof) uses the fact that the nonzero integers mod p have multi- 
plicative inverses. It follows that the congruence classes mod p form a 
field, and one can show as in classical algebra that each root z; of a 
polynomial p(z) corresponds to a factor (z — z;) of p(z). Hence there 
cannot be more roots than the degree of p(z). 

It follows in the present case that we have at least 4n — 2n = 2n 
values of z which make z2” — 1 nonzero mod p, and hence p divides 
27? 41 =m? +1 as required. 
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0.3.5 Primitive roots and quadratic residues 


Lagrange’s result that an n** degree congruence mod p has at most n dif- 
ferent solutions has another important consequence in mod p arithmetic 
— the existence of primitive roots. We digress a little further here to 
discuss this concept, since it will be important later, and it also throws 
more light on the two squares theorem. 

An integer a is called a primitive root mod p if 


a?-! = 1 (mod p), 
a’ #1(modp) forl<q<p-1. 


More generally, if we define the order of element a mod p to be the least 
positive n such that a” = 1 (mod p), then a primitive root is an element 
of order p~ 1. Existence of a primitive root means that the group of 
nonzero congruence classes is cyclic. Euler conjectured that a primitive 
root exists for each p, but was unable to prove it. The first proof was 
given by Gauss (1801), article 55. Like all proofs since, the primitive root 
is not constructed explicitly — rather, its existence is shown by counting 
the number of possible solutions of a congruence. 

The quickest proof looks at elements of relatively prime order, and 
first shows that the least common multiple 1 of the orders of nonzero 
integers mod p is itself the order of some element, a. Since / is the least 
common multiple we then have 


g'=1(modp) for «=1,2,...,p—1. 


If | < p—1 this is a congruence of degree / with more than / solutions, 
contrary to Lagrange’s result. Hence in fact |! = p— 1, which means a is 


a primitive root mod p. Oo 


The existence of a primitive root a mod p means that 
{1,a,a7,...,a?-7} = {1,2,...,p—1} 


This makes many facts about multiplication mod p quite transparent. 
For example, exactly half the integers 1,2,...,p—1 are squares, mod p. 
Indeed, the even powers 1, a”, a*,... of a primitive root are squares mod 
p, and conversely. 

The traditional term for squares mod p is quadratic residues mod p. 
Quadratic residues arise naturally in the study of quadratic forms, and 
their fundamental theorem, called “quadratic reciprocity”, was conjec- 
tured by Euler. Like the existence of primitive roots (also conjectured 
by Euler), it was first proved by Gauss. We shall say more about this 
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in Section 0.5. Two important preliminaries to the general discussion of 
squares mod p are the following: 


Euler’s criterion. m is a square mod p = mr =1 (mod p) 


If a is a primitive root mod p, 
m is square mod p > m=a* for some j 
—1 . 
=> mF =a) =1 (mod p), 
m is nonsquare mod p > m=a*! for some j 


=> pe — gi(P-1)+ FS = att =-1 (mod p). 


Quadratic character of —1. The number —1 is a square mod p & 
p=4n+1 for some n. 


Notice that this statement strengthens the result used in the proof of 
the two squares theorem, that p = 4n + 1 divides some m? + 1. It also 
follows easily from the existence of primitive roots: 


—1 is a square mod p © x* =-—1 (mod p) for some x 
<= «has order 4 (mod p) 
2 f= att where a is a primitive root 


= p=4n+1 for somen. 


0.3.6 Composition of forms 
The second reduced form of discriminant —20, namely 227 + 2zy + 3y?, 
accounts for much of the anomalous behaviour of x? + 5y”. The primes 
not represented by x? + 5y”, namely those in the classes 20n + 3 and 
20n + 7, are represented by 2x? + 2xy + 3y?, for example 


3=2x074+2x0x14+3x 1?, 
7=2x174+2x1x14+3x1?, 
23 = 2 x (—2)? +2 x (—2) x 343 x 37. 
Lagrange (1773) proved this, and also Euler’s conjecture (0.3.2) that all 
primes p in the classes 20n + 1 and 20n + 9 are of the form x? + 5y?. 
He used these two theorems to establish the conjectures of Fermat and 


Euler (0.3.2) about products and doubles of primes in the classes 20n +3 
and 20n + 7, with the help of another observation: the product of two 
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numbers of the form 2a* + 2xy + 3y? is a number of the form a? + 5y?. 
This crucial observation is based on the following algebraic identity: 


(2x? + Qxy + 3y*)(20"” + 22'y' + 3y"”) = X? + 5Y? 
where 
X = 222' + ary! + yx’ —2yy’, 
Y=ay'+ys' + yy’. 


Such an identity can be checked mechanically by multiplying out both 
sides, but how did Lagrange find it in the first place? Probably by past 
experience with products of forms, some of which were known much 
earlier. We have already seen one known to Diophantus (0.2.4): 


2 2 
(a? + y?)(2" +y") = (xa! — yy’)? + (ay! + ys’)? 
It has a generalisation 
2 2 
(a? + cy?)(a"” + cy”) = (aa! — cyy’)? + c(xy! + ys’)? 
discovered by the Indian mathematician Brahmagupta around 600 AD. 
(See Colebrooke (1817), p. 363, and Weil (1984), p. 14.) 


This would have been easy for Lagrange to derive using the complex 
factors 


(x? +cy?)(a!” +cy”) = (ant yV—c)(2-yV—0)(2' +y' V—0)(2’ -y'V=0), 


and pairing the first factor with the third, and the second with the 
fourth. Lagrange’s own identity can be derived quite mechanically from 


(a? + 5y*)(a"” + 5y’”) = (wa’ — Byy’)? + 5(ay’ + yx’)? 


(Brahmagupta’s identity for c = 5), and 
: if yy? y\? 
22° + 2ry + 3y*° = 2 (e+ =) +5(3) | 


(the result of completing the square on 2x7 + 2xy + 3y”). Use the latter 
to rewrite each of 2x? + 2xy + 3y? and 2a’? + 22'y! + 3y’?, in the form 
2[X? + 5Y?], multiply them out using Brahmagupta’s identity, then 
absorb the factors of 2. 

There is a related identity for the product of the two different forms 
of discriminant —20: 


(x? + 5y?)(Qa!” + Qa’y! + By’) = 2X? 4 2XY + 3Y? 
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where 

X = a2! — yz’ — 3yy’, 

Y = ay’ + 2ya! + yy’ 
and it can be derived in a similar way. These identities show that the 
forms x? + 5y? and 2x? + 2ry + 3y? are “closed under products” in a 
certain sense. The product operation is known as composition of forms. 

Legendre (1798) managed to show, in fact, that any two quadratic 

forms with the same discriminant could be “composed” in this fashion. 
Something very interesting was going on, but what? 


0.3.7 The class group 


Composition of forms came on the scene decades before the axiomatic 
properties of abstract structures, such as groups, were considered in 
mathematics. Legendre had found a set (the forms with fixed negative 
discriminant) and a “product” operation on it (composition) but there 
was no reason to expect the operation to have simple or interesting 
structural properties. All he could was draw up “multiplication tables” 
for the forms with particular discriminants. For example, if we take the 
forms with discriminant —20, 


A=2? +5y’, 
B = 22? + Iry + 3y?, 


the table would be 


| [A Bl 
A|A B 
BiB A 
because AA = A by Brahmagupta’s identity, BB = A by Lagrange’s 
identity, and AB = BA = B by the last identity of 0.3.6. 
A more complicated example actually given by Legendre was for the 
forms of discriminant —164, rewritten by Cox (1989)) p. 42 as: 
A= x? +41y?, 
B= 22? + Qay + 21y’, 
C = 5a? + Gay + 10y’, 
D = 32? + 2ry + 14y’, 
E = 62? + 2ay + 7y?. 
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Legendre’s multiplication table for these forms can then be written: 


AorB DorE DorE 
DorE AorC BorC 
DorE BorC AorC 


To us, of course, it is rather disturbing to see AC = BC when A # B, 
and worse still that some of the products have two values. The two- 
valued entries are due to an ambiguity of sign in Legendre’s definition 
of composition. Ambiguity would be avoided in any modern attempt to 
define an operation on a set, and it was avoided in the next study of 
quadratic forms, by Gauss (1801) (the Disquisitiones Arithmeticae). 

Gauss’s analysis of forms under composition is amazingly modern in 
some ways. He defined composition unambiguously, showed that it is 
well-defined on equivalence classes of forms and showed, in effect, that 
the equivalence classes of forms constitute an abelian group under com- 
position. It is now called the class group. Gauss even came close to 
finding a decomposition of the class group into cyclic factors. 

Yet all this was accomplished with definitions and proofs so cumber- 
some it took 70 years for the rest of the world to understand them. With 
Gauss’s definition of composition, for example, it is a major problem to 
prove that composition is associative. Even the statement of associativ- 
ity in the Disquisitiones is clumsy, as if Gauss had not really grasped 
what associativity is about: 


If the form F' is composed of the forms f, f’; the form § from F and f”; the 
form F’ from f, f”; the form §’ from F’ and f’; then the forms , §’ will be 
... equivalent. (Gauss (1801), article 240.) 


It is only by using the commutativity of composition (which is mercifully 
obvious) that this statement can be rewritten in a form recognisable as 
associativity, namely 


PENAL HLF. 


The proof is monstrous. It requires the derivation of 37 equations, most 
of which Gauss leaves to the reader. This put the subject of composition 
of forms out of the reach of most mathematicians until Dirichlet and 
Dedekind simplified it enough to make associativity obvious. A polished 
exposition was eventually given by Dedekind in his Supplement X to 
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the 2nd edition of Dirichlet’s Vorlesungen tiber Zahlentheorie (Dirichlet 
(1871)). 

However, the approach via algebraic identities, no matter how slick, 
was becoming irrelevant by this time. The abstract structure of the class 
group was seen as more important, and more usable, since facts could be 
deduced from it without reference to the definition of composition. For 
example, Kronecker (1870) showed that the decomposition of the class 
group into cyclic factors follows purely from axioms for finite abelian 
groups (which he was the first to state). Dedekind’s Supplement X to 
Dirichlet (1871) was in effect the swansong of the old theory of compo- 
sition of forms, because in the same work he showed how to rebuild the 
class group on a simpler and more general basis, the theory of algebraic 
integers. The first important examples of such integers were studied by 
Lagrange and Euler. 


0.4 Quadratic integers 
0.4.1 The need for generalised “integers” 


We have already seen how convenient it is to use factorisations involving 
square roots to prove identities about integers, such as 


(2? + cy?)(a"” + cy’) = (a2! — cyy’)? + c(ay’ + y2")?. 


Lagrange (1768) and Euler (1770) began using complex or irrational 
factorisations to find integer solutions of equations. The most interesting 
and provocative example is the Euler (1770) proof of the following claim 
of Fermat (1657): 27 is the only cube that exceeds a square by 2. This 
is equivalent to saying that y = 3, x = 5 is the only positive integer 
solution of the equation 


y? = 27 +2. 


Euler’s solution is breathtaking, even if not exactly rigorous. He fac- 
torises the right hand side into (x — /—2)(z + /—2), and proceeds to 
treat x — /—2 and x — /—2 as if they were integers. He assumes they 
have gcd 1, and therefore, since their product is a cube, they are cubes 
themselves. This presumably means that 


z+V/-2=(a+b/-2)> forsome abe Z 
= a? + 3a7bv/—2 + 3ab?(—2) + 69(—2) /—2 
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and therefore 


x = a°® — 6ab’, 
1 = 3ab — 26° 


equating real and imaginary parts. But 
1 = 3a7b — 2b? = b(3a” — b?) 


only if b = +1 and a = +1, since 1 and —1 are the only integer divisors 
of 1. This gives z = 5 as the only positive integer solution for x. Q.E.D! 

Euler gave several examples of this kind, generally splitting quadratics 
into irrational complex factors and treating the factors as integers. Why 
is this permissible, if indeed it is? To answer this question we need to 
recall how ordinary integers behave, particularly as divisors. 

The behaviour of the ordinary integers is ruled by unique prime fac- 
torisation, which in turn depends on the prime divisor property: a prime 
divides a product ab only if it divides one of a, b (0.2.2). For example, 
this is crucial in proving that relatively prime integers are squares when 
their product is a square (0.2.1). To justify similar propositions about 
“quadratic integers” such as x + \/—2, we have to decide which of them 
are “primes” and then see whether they have a prime divisor property 
like the ordinary integers. This is easiest when the quadratic integers in 
question have a “Euclidean algorithm” , because one can then follow the 
trail blazed by Euclid in his proof of the prime divisor property. 

The first to carry out such a program was Gauss (1832), who studied 
the divisibility properties of the numbers z + y/—1 , where x,y € Z. 
These numbers are now known as the Gaussian integers. They are the 
simplest kind of quadratic integers, and they tie up nicely with quadratic 
forms and some other threads in our story, so it is worth looking at them 
first. 


0.4.2 Gaussian integers 


Making the usual abbreviation i for \/—1, we denote the set of Gaussian 
integers ++ yi by Z[i]. The first step towards unique prime factorisation 
in Z[t] is to show that primes exist. We can do this immediately with 
the help of the norm, a measure of size in Z]#] introduced by Gauss. The 
norm of xz + yt , N(x + yt), is defined by 


N(x+yi) = la + yil? = 2? + y?, 
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and has the multiplicative property 
N((x + yi)(z' + y't)) = N(x + yi)N(a' +y'1), 

because 

(wa! — yy’)? + (ay! + ye’)? = (2? +y°)(e” +9”), 
by Diophantus’ identity (0.2.4). It follows that any divisor x + yi of 
X + Yi has N(x + yi) < N(X + Yi) and therefore, since the norm is a 
positive integer, each Gaussian integer of norm > 1 has a divisor x + yi 
of minimal norm > 1. 

We call any such divisor a Gaussian prime because it is not the prod- 
uct of Gaussian integers of smaller norm. By repeatedly removing prime 
divisors, we obtain a factorisation of any Gaussian integer into Gaussian 
primes. Obviously these factors are determined only up to factors of 
norm 1, the so-called units 1, —1, i, —i. This is similar to the situation 
in Z, where prime factors are determined only up to the unit factors 
+1. The real problem in Z[#], as in Z, is to establish the prime divisor 
property: if a prime divides a product then it divides one of the factors. 

Euclid’s proof of the prime divisor property for Z (0.2.2) involves his 
algorithm for finding the gcd by repeated subtraction, which terminates 
because it yields a decreasing sequence of positive integers. The proof for 
Zit] is similar, but the algorithm for gcd requires repeated division with 
remainder, which terminates because it yields a sequence of remainders 
that decrease in norm. The decrease is guaranteed by the following 
division property of Zii|: for any Gaussian integers a and GB # 0 there 
are Gaussian integers 4 and p ( “quotient” and “remainder”) such that 


a=pB+p where 0< |p| < Il. 


The division property is clear as soon as one realises that the multiples 
LG of £ lie at the corners of a lattice of squares in the complex plane. A 
typical square is the one with corners 0, 6, 18, (1 +7)8. The remainder 
p is the difference between a and the nearest corner pf in the lattice, 
so 0 < |p| < |8] because the distance |p| between any point in a square 
and the nearest corner is less than the length |(| of a side. 

To find gcd(a, 8) one can therefore let a = a1, 8 = (, and repeatedly 
compute 


aj+1 = Bj, 
8341 = remainder when a; is divided by £;, 


until a zero remainder (; is obtained. (We use j,k,... for indices from 
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now on, since 7 will be reserved for /—1.) Then gced(a, 3) = (%, and 
there are Gaussian integers yp and v such that gcd(a, 3) = pa + vZ by 
an argument like that used for Z (0.2.2). In fact, the rest of the route 
to unique prime factorisation is essentially the same as in Z. We get the 
prime divisor property by arguing that 


1 = gced(z,a) = pr+va 


when 7 is a Gaussian prime not dividing a, and multiplying both sides 
by 8. Unique prime factorisation (up to unit factors) is obtained by 
supposing 

11% +++ Tr = ibe: ds 


are two prime factorisations of the same number, and cancelling 71, 72, ... 
in turn until only units remain. 


0.4.8 Gaussian primes 

Identifying the actual primes of Z[i] is a separate question, and Gauss 
observed that it is closely connected with Fermat’s two squares theorem. 
This is because N(x + yi) = x? + y’, so divisors of x + yi have norm 
dividing z? + y?. In particular, if x? + y? is an ordinary prime then 
x+yi is a Gaussian prime, because it cannot be the product of numbers 
of smaller norm. Fermat’s two squares theorem tells us that the primes 
of the form x? + y? are exactly the primes p = 4n +1 and p = 2, so each 
such x? + y? gives us two Gaussian primes, r+ yi and x — yi. Moreover, 
the factorisation 


a? +y? = (+ yi)(x— yt) 


is unique up to unit factors, by unique prime factorisation in Z|]. Hence 
there are exactly two Gaussian primes, up to unit factors, for each or- 
dinary prime p = 4n + 1. (This shows, incidentally, that the partition 
x* + y* of p into two squares is unique, a result also stated by Fermat.) 
The ordinary prime 2 is exceptional, being the square of the Gaussian 
prime 1 +7, up to a unit factor. 

Conversely, if « + yi is a Gaussian prime then so is its conjugate 
x — yi, since any factorisation of x — yi yields a factorisation of x + yt 
by conjugation. Thus 


a? +y* =(x+yi)(x— yi) 


is a Gaussian prime factorisation, and hence unique up to unit factors. 
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It follows that x? + y? is an ordinary prime, since any factorisation of it 
into ordinary primes would yield a different factorisation into Gaussian 
primes. 

Thus the “properly complex” Gaussian primes are (up to unit factors) 
the conjugate factors x + yi and z — yi of ordinary primes of the form 
z*+y?,. By Fermat’s two squares theorem these are the primes p = 4n+1 
and p = 2. The real Gaussian primes are the ordinary primes p = 4n+3, 
since these have no divisors of the form x? + y* (by Fermat’s theorem 
again) and hence no complex Gaussian prime divisors x + yi (since a 
divisor x + yt implies a divisor x — yi, by conjugation). 

The close relation between Gaussian primes and the ordinary primes 
of the form x? + y? suggests that the theory of Z[i] may be used to prove 
Fermat’s two squares theorem. Dedekind gave at least two such proofs, 
one in the memoir below (§27) and another in his final version of ideal 
theory (Dedekind (1894), §159). The latter proof is short enough to 
describe here, since it builds on the result of Lagrange that p = 4n+1 
divides a number of the form m? + 1, which we have already described 
in 0.3.4. 

Since p does not divide either of the Gaussian factors m +i, m—i 
of m? + 1 (the quotients a 3 are not Gaussian integers), p is not 
a Gaussian prime, by the prime divisor property of Z[i]. We therefore 
have a Gaussian factorisation 


p= (z+ yi)(x — yt), 
which gives 
p= x? + y’. 


This argument from unique prime factorisation replaces Lagrange’s ar- 
gument from the equivalence of quadratic forms with discriminant —4. 
In fact, Dedekind (1894), §159, goes on to deduce the equivalence of 
these forms directly from unique prime factorisation in Z[i]. 


0.4.4 Imaginary quadratic integers 
As an obvious generalisation of the Gaussian integers we can consider, 
for any positive integer c, 


Z[V—e] = {x + yV—c: 2, y € Z}, 
with norm defined by 
N(x +yV=0) = |e t+ yV—el” = 2? + cy’. 
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The multiplicative property of the norm, 
N((e + yV=0)(2! +y'V=0)) = N(@ + yV=2)N(2! +y/'V=0), 


is equivalent to the Brahmagupta identity (0.3.6) and one proves, as 
in Zi], that each element of Z[,/—c] has a factorisation into primes. 
However, the uniqueness of the prime factorisation depends on the value 
of c, as we shall see in the next section. 

After the Gaussian integers (c = 1), the case c = 2 is also of interest, 
since Z[,/—2] contains the numbers r+./—2 used by Euler in his solution 
of the equation y? = x? +2. Unique prime factorisation also holds in 
Z[/V—2]. The proof, like that in Z[#], is based on the division property: 
for any a and 3 # 0 there are wp and p in Z[/—2] (“quotient” and 
“remainder”) such that 


a=pB+p where 0< |p| < |}. 


The division property holds in Z[,/—2] for similar geometric reasons. 

The multiples 4 of G lie at the corners of a lattice of rectangles in the 

complex plane. A typical rectangle is the one with corners 0, 3, /—2(, 

(1++/—2)@. The lengths of its sides are |G| and /2|(|, so the maximum 

distance of any point from a corner (attained by the centre point) is 
3|| < |G|, as required for the division property. 

It follows that Z[,/—2] has a Euclidean algorithm, and hence unique 
prime factorisation, by the argument used for Z[i]. Since N(r+y/—2) = 
x* + 2y’, the only units in Z[,/—2] are +1, and prime factorisation is 
unique up to sign (as in Z, and hence “more unique” than in Z[i]). The 
actual primes in Z[,/—2] can be described using Fermat’s theorem (0.3.2) 
that the (ordinary) primes of the form 2? + 2y? are those of the form 
p = 8n+1 or p= 8n +3. Each ordinary prime p = 8n + 1 = 2? + 2y? 
or p = 8n+3 = x? + 2y splits into primes z + y/—2 and x — y/—2 
of Z[,/—2], and each ordinary prime p = 8n + 5 or p = 8n +7 remains 
prime in Z[/—2]. 

Finally, to justify Euler’s solution of y? = x? + 2 it remains to prove 
that 


ged(z + /—2, 2 — /—2) = 1 


and that relatively prime numbers in Z[,/—2] are cubes when their prod- 
uct is a cube. 

To see why ged(x++/—2, x—1/—2) = 1, note that any common divisor 
of r+ /—2 and z— /—2 is also a divisor of their difference 2\/—2. Also, 
any solution x of y? = x? +2 must be odd, because an even x yields 
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an even y, in which case 8 divides y? and does not divide x? + 2. Thus 
N(x + /—2) must be odd, and hence its gcd with N(2\/—2) = 8 is 1. 
But this implies that 1 is the gcd of x — /—2 and 2,/—2, and hence of 
x+/-2 and « — /—2. 

Thanks to unique prime factorisation, and the fact that the units of 
Z[/—2] are +1, the proof that relatively prime numbers are cubes when 
their product is a cube is essentially the same as in Z. This in turn is 
like the proof for squares mentioned in 0.2.1. 


0.4.5 The failure of unique prime factorisation 
It should not be a great surprise that something goes wrong in Z[{./—5], 
because the norm of z+y/—5 is the anomalous quadratic form x? +5y?. 
In short, prime factorisation is not unique. Consider the following two 
factorisations of 6: 


2x 3= (1+ V—5)(1- V—5). 


The norms of the factors are N(2) = 4, N(3) = 9, N(1+ /—5) = 6 
and N(1 — /—5) = 6. The only nontrivial divisors of these norms 
are 2 and 3, which are not norms, hence the factors 2, 3, (1+ /—5), 
(1 — /—5) themselves have no nontrivial divisors. These factors are 
therefore primes in Z[,/—5], and they certainly differ by more than unit 
factors, since the norms of the factors on the left are different from the 
norms of the factors on the right. 

So here we have another anomalous behaviour of x? + 5y*. Its ir- 
rational factors « + y/—5 and x — y/—5 belong to a set Z[,/—5] of 
“{ntegers” without unique prime factorisation. Can this be related to 
the fact that x? + 5y? is one of two inequivalent quadratic forms with 
discriminant —20? 

Indeed yes. The nonuniqueness of prime factorisation in Z[,/—5] can 
be traced to the existence of ordinary integers x? + 5y? with divisors 
not of the same form. If each number x + y/—5 had a unique prime 
factorisation in Z[/—5], 


etyV—5 = (41 +y1V—5) (x2 + yoV—5) --- (2e + yr V—5), 
then we should have the following factorisation of its norm in Z: 
a? + 5y? = (af + 5yf) (23 + 5yz) +++ (az + 5yz). 


And each factor (x? + 5y?) would be an ordinary prime or the square 
of an ordinary prime, since any further decomposition of x? + 5y? into 
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ordinary primes would yield a different prime factorisation of x +y/—5 
in Z[/—5]. Then, since the product of numbers of the form (x? + 5y?) 
is another number of the same form (by Brahmagupta’s identity), each 
divisor of z?-+5y? would also have this form. But of course 6 has divisors 
2 and 3, which are not of the form x? + 5y?. Instead, they are of the 
other form with discriminant —20, namely 2x? + 2xry + 3y?. 

Most of this could have been noticed by Lagrange, but the failure 
of unique prime factorisation in Z[,\/—5] was not explicitly pointed out 
until Kummer (1844) proposed the introduction of “ideal factors” to save 
it. Kummer (1846b) went on to define a notion of equivalence for ideal 
factors, under which ideal factor classes correspond to classes of forms 
with the same discriminant. Thus inequivalence of forms is actually the 
same thing as failure of unique prime factorisation. Kummer’s discovery 
changed the whole direction of number theory at this point, from the 
theory of forms to the theory of ideal factors. However, it would be an 
oversimplification to say that this happened just because of quadratic 
forms like x? + 5y” . A second major influence was the theory of roots 
of unity, which will be discussed in the next section. 

It is possible that number theory could have changed direction much 
earlier, if not for the conservatism of Gauss. According to Kummer 
(1846a), Gauss was aware of the failure of unique prime factorisation for 
quadratic integers when he wrote the Disquisitiones, and could see that 
something like ideal factorisation was needed. However, he was unable 
to find a rigorous description of ideal factors, and invented the theory 
of composition of forms as a substitute. He later told Dirichlet 


If I wanted to proceed with the use of imaginaries in the way that earlier 
mathematicians have done, then one of my earlier researches which is very 
difficult [composition of forms] could have been done in a very simple way. 
(Excerpt from Kummer (1846a) in Edwards (1977), p. 335.) 


Having invested so much energy in composition of forms, Gauss perhaps 
became unwilling to pursue the alternative theory of quadratic integers. 
As we know, it was only in 1832 (the paper on Gaussian integers) that 
he got as far as proving unique prime factorisation for Z[i]. 

On the other hand, it must be admitted that Gauss’s work on other 
topics — the theory of regular polygons (or circle division) and the search 
for reciprocity laws — was crucial in the development of the more general 
concept of algebraic integer. As we shall now see, this work also belongs 
to the theory of roots of unity. 
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0.5 Roots of unity 
0.5.1 Fermat’s last theorem 


The Pythagorean equation is probably the most fruitful equation in num- 
ber theory, having been the inspiration for important results of Euclid 
(0.2.1), Diophantus (0.2.4) and Fermat (0.3.1). Diophantus threw new 
light on the Pythagorean equation by considering rational solutions, in- 
stead of just integer solutions, and by showing that any nonzero square 
in Q splits into two nonzero squares. Next to this result in Diophantus’ 
Arithmetica, Fermat wrote his famous marginal note: 


It is impossible to separate a cube into two cubes, or a biquadrate into two 
biquadrates, or in general any power higher than second into powers of like 
degree; I have discovered a truly marvellous proof of this which however this 
margin is too small to contain. 


This claim of Fermat’s became known as Fermat’s last theorem in the 
early 19th century, as by then it was the only claim of Fermat’s still to 
be proved or disproved. 

Fermat’s last theorem is the claim that the equation 


a +5" =o" 


has no nonzero solution a,b,c € Q when n is an integer > 2. Since any 
rational solution becomes an integer solution when multiplied through 
by a common denominator, an equivalent claim is that a” + b” = c” 
has no nonzero integer solution. Although these two claims are logically 
equivalent, they suggest different viewpoints and different approaches 
to proving the the theorem. The approach through rational numbers, 
which stems from Diophantus, is “geometric” in a broad sense. The 
approach through integers, which stems from Fermat and Euler, is “al- 
gebraic”. The geometric approach seems to have won the day, but only 
by calling on large amounts of algebra, analysis and topology as well. 
The algebraic approach, while falling short of Fermat’s last theorem, 
was an outstanding success in the development of algebra as a whole. In 
particular, it was the main stimulus for the development of Dedekind’s 
theory of ideals. 

Fermat’s own contribution to Fermat’s last theorem was slight, as far 
as we know. The “marvellous proof” he claimed in his marginal note 
was probably based on a mistake, as he did not repeat the claim later, 
and left only the proof for n = 4 (0.3.1). This proof does not generalise 
to other values of n, and it is the only case of Fermat’s last theorem that 
has been proved by elementary methods. 
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The search for an algebraic proof of Fermat’s last theorem really begins 
with the Algebra of Euler (1770). As we have seen in 0.4.1, this book 
contains a brilliant, and basically sound, treatment of the equation y? = 
x* +2. It also contains a treatment of 23+? = z3, the Fermat equation 
for n = 3. The reasoning is not quite so sound, but it is extremely 
thought-provoking, and pregnant with possibilities for generalisation. 

Euler rewrites the equation as 


y= 2—23, 


and then factorises the right hand side into 
(z — 2)(z — G32)(z — Gar) 


where 


are the imaginary cube roots of 1. At this point Euler makes an inter- 
esting mistake. He wants to argue that the factors z—2z, z—(32, z—C2x 
are relatively prime factors of a cube, hence cubes themselves, but uses 
this argument in the wrong setting — in Z[,/—3] rather than Z[¢3]. It so 
happens that unique prime factorisation fails in Z[,/—3], as can be seen 
from the example 


4=2x2=(1+ /—3)(1- V-3), 


whose factors all have minimal norm, 4, for nonunit elements of Z[,/—3]. 
However, unique prime factorisation is valid in Z[C3], for geometric rea- 
sons like those that apply to Z[{i] and Z[,/—2], and Euler’s idea can be 
made to work. 


0.5.2 The cyclotomic integers 


The numbers Z[{¢3] in the proof of Fermat’s last theorem for n = 3 are 
examples of cyclotomic integers. The general ring of cyclotomic integers 
is Z[¢n] where 


2x |, 2 
Cn = cos — +7sin — 
n n 


is a complex n* root of 1. It was first studied by Gauss (1801), for 
geometric reasons. The points ¢,,¢2,...,¢" = 1 are equally spaced 


0.5.2 The cyclotomic integers 31 


around the unit circle, hence the name “cyclotomic” , from the Greek for 
“circle-dividing”. Since 


Gel =n - WG tetGn +1) and G #1 
it follows that ¢, satisfies the equation 
mote t Gn t+1=0, 
and hence Z[¢,| can be described explicitly as 
ZlGn] = {a0 + @16n +°- + Gn-267? : a9, a1, -..,@n—2 € Z}. 


When n is a prime, 2”~! + --- + z +1 = 0 is called the cyclotomic 
equation. Gauss (1801), article 341, proved that it is the equation of 
minimal degree, with integer coefficients, satisfied by ¢,. When n is not 
prime the cyclotomic equation is of degree lower than n — 1, and is not 
so easy to describe. 

Gauss was initially interested in Z[¢,,] because of its bearing on the 
construction of the regular n-gon by straightedge and compass, which is 
possible when ¢,, is expressible in terms of rational operations and square 
roots. He made the amazing discovery that this occurs for precisely the 
n that are products of a power of 2 with distinct primes of the form 
92” +1. The only known primes of this form are 3, 5, 17, 257, 65537, 
but the latter three yield regular n-gons not previously constructed. 
In fact, Gauss’s construction of the regular 17-gon was the first such 
construction since ancient times, and was an important factor in his 
decision to become a mathematician. 

Perhaps the most surprising feature of this discovery is that primes 
22" +1 had already been considered — for an entirely different reason 
— by Fermat. Fermat mistakenly believed the numbers 22” +1 to be 
prime for all values of h (Fermat (1640a)). This is false. Euler found 
that 641 divides 2?” + 1, and since then no more such primes have been 
found. Yet in a way Fermat was not far wrong — what is true is that no 
two numbers 22” + 1 have a common prime factor. This was observed 
by Pélya and Szegé (1924), and used by them to give a new proof that 
there are infinitely many primes. The numbers 22" +1 are now called 
Fermat numbers, and the primes among them are called Fermat primes. 

Gauss’s discovery may have closed a chapter in the history of geom- 
etry, but it opened a new one in the history of number theory. Quite 
apart from drawing attention to the problem of finding all Fermat primes 
(intractable so far), the theory of cyclotomic integers created new links 
between different parts of number theory. In particular, Gauss found 
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connections between arithmetic modulo p, the quadratic integers Z[,/£p] 
and the cyclotomic integers Z[¢,]. As we shall explain in the next sec- 
tion, the connection between cyclotomic integers and quadratic integers 
came to light because a solution of the cyclotomic equation by square 
roots is needed to construct the regular p-gon. 

Cyclotomic integers also arise naturally in connection with Fermat’s 
last theorem, because of the factorisation 


2" — a" = (z—a)(z— Cnt) (2- Gta), 


which generalises the factorisation used by Euler in his attempt to prove 
Fermat’s last theorem for n = 3. In fact, Lamé (1847) used this formula 
in an attempt to prove Fermat’s last theorem for arbitrary n > 2. He as- 
sumed, as would follow from unique prime factorisation, that relatively 
prime factors of an n‘* power are themselves n‘ powers. Unfortunately, 
he failed to check uniqueness of prime factorisation in Z[¢,], and this 
omission turned out to be fatal. Unique prime factorisation is crucial, 
and it fails in Z[¢23]. Unbeknownst to Lamé, Kummer (1844) had al- 
ready discovered this, and had taken steps to deal with the problem. 
However, this is getting ahead of our story. Let us return to Gauss. 
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Since a straightedge creates lines (which have equations of degree 1) and 
the compass creates circles (which have equations of degree 2), construc- 
tions find intersections of lines and circles, hence they solve linear and 
quadratic equations, and this can be done by rational operations and 
square roots. Thus if the regular p-gon is constructible by straightedge 
and compass, there is necessarily a solution of the cyclotomic equation 


23 4...+24+1=0 


by rational operations and square roots. The roots are Cp, C hs oh, 
hence the equation says that the sum of all the roots is —1. Gauss solves 
the equation by stepwise evaluation of sums of progressively smaller 
subsets of the roots, now known as Gauss sums. In the case p—1 = 22” 
each subset is half the size of the previous one, and its sum is obtainable 
from the preceding Gauss sum by square roots. Thus in 2" steps one 
reaches a single root of the cyclotomic equation, expressible by square 
roots alone. 
For example a root ¢ of the cyclotomic equation for p = 5 satisfies 


C+O+C4+O4+1=0, 
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which can rewritten 
(8 4+¢7) + (8 +¢7)-1=0, 


since (C3 + ¢7)? = ¢4 + 2¢5+¢ and ¢> = 1. Thus we have a quadratic 
equation for ¢? + ¢?, which can be solved by square roots, and we can 
then find ¢° and ¢? individually by solving quadratic equations, since 
we know their sum and product. 

Of course, if p — 1 is not a power of 2 one cannot repeatedly halve 
the number of terms in the Gauss sums until only one term remains. 
However, in the nontrivial case where p is odd, the first halving is always 
possible. From now on we shall use ¢, without subscript, to denote an 
arbitrary root of the cyclotomic equation. The set of all roots ¢* is 
partitioned into those for which k is a square mod p, and those for 
which k is a nonsquare. As we know from the existence of primitive 
roots (0.3.5), these sets are of equal size. In fact, we know that the 
squares mod p are the even powers of a primitive root mod p, and the 
nonsquares are its odd powers. In the case p = 5, 2 is a primitive root, 
so its odd powers are 2 and 23 = 8 = 3 (mod 5), and the corresponding 
Gauss sum is the one used above: ¢3 + ¢?. We could also have used its 
complement, ¢4 + ¢, whose exponents are the squares mod 5. 

In the Disquisitiones, article 356, Gauss showed that the sums whose 
exponents are the squares and nonsquares mod p are the two roots of 
the equation 

Pepe Pet a4, 
4 
with the — sign when p = 1 (mod 4) and the + sign when p = 3 (mod 
4). Since the roots are —} + ve in the former case and -4 + iv? in the 
latter, it follows that, if a is a primitive root mod p, 


yee a yon =+,/p when p=1 (mod 4), 
j j 
: ai g?itt 4 

xs -y¢ =+i,/p when p=3 (mod 4). 
F : 


] 


The left hand sides of these equations can be written more concisely with 
the help of the Legendre symbol or quadratic character symbol defined 


by 
(=) 7 { +1. if k is a square mod p 


D -—1 if k is a nonsquare mod p. 
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Quadratic characters have the following multiplicative property, 


G) G)-G). 


which follows immediately from the fact that the squares mod p are 
precisely the even powers of a primitive root. 

In terms of quadratic characters, the Gauss sum on the left of the 
equations is simply S = ike (4) ¢*, and Gauss’s theorem is equivalent 
to: 

e={ +p if p=1 (mod 4) 
—p if p=3 (mod 4). 


Even more concisely, 


since we know from 0.3.5 that —1 is a square mod p if and only if p=1 
(mod 4). 
The proof that S$? = (3) p goes as follows. Since 


a 6 
S? ~~ (£) (;) k+l 
»& Pp Pp ¢ 
es = kl k+l 
= 2s (5) 


by the multiplicative property of quadratic characters. 

Now each k # 0 (mod p) has a multiplicative inverse mod p (0.3.1), 
hence as k runs through the nonzero congruence classes mod p, so does 
kl, for fixed | 4 0 (mod p). We may therefore replace k by ki, and 


= 3 (=) cut 
Pp 


k,l=1 
p-l 

=> (2) cl(k+1) 
Risin 


since k is a square mod p © ki? is a square mod p 
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EC) -E6) Eom) 
(5) EC) vcore 


since [(k + 1) for fixed k < p—1 runs through the 


honzero congruence classes 


-(*)o-» >» (2) 


by the cyclotomic equation 
p-1 

=a) Goa 
(PS eae se ( 2 ea 

(2) e-» po Ge 
= e-*) 

(F) 

since half the (4) are +1 and half are —1, by 0.3.5 


= (=) p since p—1=-1 (mod p). 


It follows that S = + (=p, and hence Z[¢p] contains either \/p or 
./—p. Thus Gauss’s computation forges links between squares mod p, 
Z[¢p] and Z[,/£p]. In the next section we shall how Gauss exploited 
these links to give a proof of quadratic reciprocity. 


Remark. The relationship of cyclotomic integers with quadratic inte- 
gers is special, and there is no comparable relationship with, say, cubic 
integers. For example, ¥/2 does not belong to any Z[¢,]. One way 
to see this is to extend Z[¢,] to the field Q(¢,), by allowing arbitrary 
quotients, and to consider automorphisms of fields. Automorphisms of 
Q(¢p) extend functions of the form 7;(¢p) = ad and hence any two of 
them commute. On the other hand, any field containing /2 and (3 
has noncommuting automorphisms, extending all permutations of the 
set {¥/2, C3 72, (2 7/2}. We shall not elaborate on these hints, since the 
ideas belong to Galois theory, which Dedekind wishes to avoid. However, 
they do show that Galois theory is just over the horizon. 
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0.5.4 Quadratic reciprocity 


The quadratic reciprocity law is the following symmetric relationship 
between the quadratic characters of two odd primes _p, g. 


P\(q\ _f -1 if p,q=3 (mod 4) 
aq) \p/ | +1. otherwise. 


This surprising relationship can be extracted from the g‘® power of the 
Gauss sum S$ = <?7} (4) ¢* by using the “mod q binomial theorem” 


(e+y)? =a2%+y!% (mod gq). 


As mentioned in 0.3.1, this holds for g prime because in that case all the 
binomial coefficients (7),..., ( oa) are divisible by g. This leads to the 
“mod gq multinomial theorem” 


(ay +ag+-+++ap)? = zi +arp+---+2f% (mod g) 


by induction on k, when 21, %2,...,2, are indeterminates. We are able 
to treat the powers ¢,¢?,...,¢?—! as indeterminates because the mini- 
mality of the cyclotomic polynomial (0.5.2) means that each element of 
Z[¢] is uniquely expressible as a sum yee a,C*. Here is the gt power 
calculation: 


p-l k 
= (=) ck (mod q) 
k= \P 
p-1 k a 
= Pa (=) cra since (*) — (4) for any odd power a 
p-1 2 : 
= (2) (=) cP since (2) =] 
pai \P 4 
p-1 
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since, for fixed g, kq runs through the nonzero congruence classes. We 
now multiply both sides by S and use S? = (+) S, obtaining 


oe (2) (Zs mn 


The left hand side is 


(Ss?) = (=) r pe 
p 


(2) (8) on 


Finally, Euler’s criterion (0.3.5) allows us to replace pir by the quadratic 
character (2), and since (+) = +1 we have 


(2)=(2) « orem 
(2)(0)-(@) « aesiento 


from which the statement of quadratic reciprocity follows, using the 
value of (}), the quadratic character of —1, known from 0.3.5. 


Remarks. The law of quadratic reciprocity has probably been proved in 
more ways than any other theorem in mathematics except the theorem 
of Pythagoras. Gauss himself gave eight proofs, of which the above is the 
sixth (Gauss (1818)). There are shorter or more elementary proofs, but 
the one using Gauss sums is especially enlightening, because of the way 
it relates squares mod p to quadratic integers and cyclotomic integers. 
It was certainly an inspiration to Dedekind, and his own proof (§27 of 
the memoir) shows how Gauss’s ideas can be transformed with the help 
of ideals. Dedekind says he is giving the sixth proof of Gauss, but he 
succeeds in eliminating all the computations. This is exactly what he 
intends to accomplish with the theory of ideals. As he says in §12: 


It is preferable, as in the modern theory of functions, to seek proofs based 
immediately on fundamental characteristics, rather than on calculation, and 
indeed to construct the theory in such a way that it is able to predict the 
results of calculation. 
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0.5.5 Other reciprocity laws 


The appearance of roots of unity in the proof of quadratic reciprocity 
is very interesting and historically important, for the reasons just men- 
tioned, but it is not strictly necessary. There are many other proofs 
involving just arithmetic mod p and g. One reason it is easy to avoid 


complex numbers is that the real numbers +1 and —1 can serve as values 


of the quadratic character (£). 


If, on the other hand, one wants to define a multiplicative “cubic 
character” (s , with three different values according as k is of the 


form a3", a3"+! or a3"+? (where a is a primitive root mod p), then real 
values of the character will not work. In fact, the obvious values are 


k 1 if k=at 
( =) = 3 if k= qi"! 
3 


G if k =a%"+?, 
Similarly, the obvious values of the “biquadratic character” (4), are 
1,i,-—1 and —i. Thus there is a more pressing reason to use roots of 
unity when one seeks reciprocity laws for 3° and 4*" powers. The latter 
was in fact what Gauss was doing when he developed the theory of 
unique prime factorisation in Z[i]. Likewise, Eisenstein (1844) found a 
cubic reciprocity law through investigation of Z[¢3]. 

Kummer even declared that cyclotomic integers were more important 
for their application to higher reciprocity laws than their application to 
Fermat’s last theorem. In 1847 he said: 


The Fermat theorem is indeed more of a curiosity than a main point in the 
science [of number theory]. (Kummer (1847b).) 


And a few years later, when he had made progress on reciprocity laws: 


Through my investigations in the theory of complex numbers [cyclotomic in- 
tegers] and their applications to the proof of the Fermat theorem, which I 
have communicated to the Academy of Sciences over the past three years, I 
have succeeded in discovering the general reciprocity law for arbitrary power 
residues, which in the present state of number theory stands as the main 
problem and pinnacle of the discipline. (Kummer (1850b).) 


However, he said this when he was still flushed with success over reci- 
procity — and when his work on Fermat’s last theorem was running out 
of steam. In the long run not everyone agreed with him. When Hilbert 
surveyed algebraic number theory in his influential Zahlbericht (Hilbert 
(1897)), he gave pride of place to Kummer’s results on Fermat’s last 
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theorem. Even today, when Kummer’s approach to Fermat’s last theo- 
rem has been abandoned, its ideas are at the core of algebraic number 


theory. 


0.6 Algebraic integers 
0.6.1 Definition 
The examples of quadratic and cyclotomic integers hint at a general 
concept of algebraic integer, but it is not clear how it should be defined. 
In fact, it is not even clear how quadratic integers should be defined in 
general. An example which underlines the subtlety of the concept is 


Z[V-3] = {m+nV/-3: m,n € Z}. 


As already mentioned in 0.5.1, unique prime factorisation fails in Z[,/—3], 
but it is valid in Z[¢3], where ¢3 = =hiys3 Thus we would be happier if 
all members of Z[¢3] were classed as integers, and not just the members 
of Z[/—3]. At the same time, of course, we do not want to admit all 
the members of 


Q{V—3] = {s +tV—3: s,t € Qh. 


This would certainly be going too far, since all rational numbers would 
be included. 

A definition which draws the line at just the right place is the one 
given by Dedekind in the introduction to his memoir. He states it in 
a way that makes it a natural specialisation of the concept of algebraic 
number: 

A number @ is called an algebraic number if it satisfies an equation 
0” +.a10"~* + 420"? +--+ + @n-10 + an = 0 
with finite degree n and rational coefficients a1, a2,...,@n—1,@n. It is called an 


algebraic integer, or simply an integer, when it satisfies an equation of the form 
above in which all the coefficients a1, a2,...,@n—1,@n are rational integers. 


This definition has some desirable properties, as we shall see in the next 
section, but it also has one undesirable property: if a is an algebraic 
integer then so is fa, hence every algebraic integer has a nontrivial 
factorisation a = ,/a,/a. This is very different from the behaviour of 
ordinary integers, and the theory of all algebraic integers is therefore 
not very useful in the study of Z, for the reasons given in 0.1. 
Dedekind escaped this situation by restricting attention to the integers 
in a field of finite degree, which can be most simply defined as the closure 


40 0.6 Algebraic integers 


Q(a) of QU {a} under rational operations, where @ is an algebraic 
number. The integers of Q(a) are simply the algebraic integers in Q(a). 
When a is an algebraic integer they include all members of Z[a] but 
sometimes more. For example, the set of integers in Q(./—3) is not 
Z{/—3] but Z[C3]. On the other hand, the set of integers of Q(¢,) is 
indeed Z[¢,]. Kummer, working without a general definition of algebraic 
integer, actually defined his integers to be the members of Z[¢,], so in a 
sense he was lucky to hit on the “correct” integers. 


0.6.2 Basic properties 
Dedekind first stated his definition of algebraic integer in his Supplement 
X, §160, to the second edition of Dirichlet’s Vorlesungen tiber Zahlen- 
theorie (Dedekind (1871)). There he added that: 


It follows immediately that a rational number is an algebraic integer if and 
only if it is a rational integer. 


This result goes back at least as far as Gauss. In article 11 of the 
Disquisitiones he assumes the reader knows that a rational solution of a 
monic polynomial equation with integer coefficients is itself an integer. 

It follows that an ordinary integer a does not divide an ordinary integer 
b, as an algebraic integer, unless a divides b as an ordinary integer. 
This is obviously crucial when results about ordinary integers are being 
derived as special cases of results about algebraic integers. Similarly 
crucial properties of algebraic integers are closure under +, — and x. 
These follow from another property of monic polynomial equations, first 
pointed out by Eisenstein (1850). 

If f(z) = 0 is a monic polynomial equation with ordinary integer co- 
efficients and roots a1, Q2,..-,Qn, and if g(ai,...Qn) is any polynomial 
in the roots with ordinary integer coefficients, then g(a1,.-..Qn) is also 
a root of a monic polynomial equation h(x) = 0 with ordinary integer 
coefficients. 

Eisenstein’s proof uses the theorem of Newton (1665) that symmetric 
polynomials in the roots are polynomial functions of the coefficients, 
applying it to the equation 
h(x) = Il (x — g(Qo(1); +++ @o(n))) = 0. 
all permutations o of 1,2,...,.n 
Obviously h(x) is monic, with symmetric coefficients, hence it actu- 
ally has integer coefficients, and its roots include g(a1,...,@n). Taking 
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g(@1, @2) = a1 +Q2, @ — ag Or A142 shows that a1 + a2, a1 — a2, A102 
all satisfy monic equations with ordinary integer coefficients, and hence 
they are algebraic integers. 

Dedekind remarks in §17 of the memoir that he wishes to avoid the 
theory of symmetric functions. In fact in §13 (and previously in §160 
of his Supplement X to Dirichlet (1871)) he has already derived clo- 
sure from the fact that a system of homogeneous linear equations with 
nonzero solution must have nonzero determinant. These proofs, along 
with the concepts of basis and independence he also introduced, were 
important in the development of linear algebra as a self-contained disci- 
pline. Setting the theory of algebraic integers within the theory of fields 
of finite degree gave this development an extra push, because these fields 
are vector spaces of finite dimension over the rationals. The reason is 
simple: if the minimal rational polynomial equation satisfied by a is of 
degree n, then 1,a,a”,...,a"—! is a basis for Q(a) over Q. 

Fields of finite degree are important to the concept of algebraic inte- 
ger because they admit a suitable concept of norm. Again generalising 
the situation for quadratic and cyclotomic integers, Dedekind defines 
the norm N(w) of any number w € Q(a). If a,a1,@2,...,Qn—1 are 
the roots of the irreducible monic equation satisfied by a, and ¢(a) 
is the rational function of a equal to w, then N(w) is the product 
of ¢(a), d(a1), b(@2),..-,(An-1), the so-called conjugates of w. As 
Dedekind explains in §16 and §17 of the memoir, the conjugates of w are 
also its images under all the isomorphisms of Q(a) onto other number 
fields. The latter fact makes it easy to see the multiplicative property of 
the norm: N(wiwe) = N(wi)N(we). It turns out that N(w) is always 
a rational number, and N(w) is a rational integer when w is an integer 
of Q(a). This means that every integer has a divisor of minimal norm, 
and hence a factorisation into “primes” (though the factorisation is not 
necessarily unique). 


0.6.3 Class numbers 


As already mentioned in 0.3.3, Lagrange (1773) gave the first rigorous 
results bearing on unique prime factorisation of algebraic integers. His 
reduction process for quadratic forms gives an easy way to find the 
class number of quadratic forms with negative discriminant, because the 
reduced forms are inequivalent, apart from some simple exceptions. The 
class number of x? — cy? can be redefined and reinterpreted as the class 
number of the field Q(,/c) (and generalised to any field of finite degree, 
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see §29 of the monograph). Lagrange’s computations were extended by 
Gauss (1801), who showed that the class number of Q(,/c) is 1 for c = 
—1, -2, -3, -7, -11, —19, —43, —67, —163 (Disquisitiones, article 303). 
He also conjectured that these are the only negative c for which Q(,/c) 
has class number 1, a conjecture which remained open until 1966, when 
it was proved by Baker and Stark. (See Baker (1966) and Stark (1967).) 
An equivalent statement is that these are the only quadratic fields with 
negative discriminant and unique prime factorisation. 

The situation is more difficult for quadratic fields with positive dis- 
criminant. There is a similar reduction process, also due to Lagrange, 
which shows there are only finitely many reduced forms, but it is no 
longer clear which of them are equivalent. Thus we do not immediately 
know the class number, only that it is finite. Gauss made some progress, 
giving an algorithm in article 195 of the Disquisitiones to decide equiv- 
alence of reduced forms. Dirichlet (1839) made even greater progress, 
using analysis to discover a formula for the class number of quadratic 
fields. While analytic methods are beyond our scope, Dirichlet’s result 
does indicate the depth of the problem. No more elementary approach 
has yet been found, and his related theorem on primes in arithmetic 
progressions (Dirichlet (1837)) has likewise not been proved in a more 
elementary manner. 

Apparently, Dirichlet also found an analytic formula for the class 
number of the cyclotomic field Q(¢,). Kummer (1847a) mentions this, 
while giving a direct proof that the class number of Q(¢,) is finite. 
He gives the following specific values for which the class number is 1: 
p = 5,7,11,13,17,19. Thus unique prime factorisation holds in these 
fields, and the first failure is in Q(¢23), which Kummer shows to have 
class number 3. Dirichlet never published his class number formula for 
Q(¢,), presumably deferring to Kummer, and Kummer (1850a) gave his 
own version. 

The finiteness of the class number for all fields of finite degree was 
first proved by Dedekind (1871), in his first account of ideal theory. The 
proof is similar to the one given in the memoir below. 


0.6.4 Ideal numbers and ideals 


The long story of unique prime factorisation, lost and regained, can now 
be summarised as follows. Euclid discovered unique prime factorisation 
in Z, to all intents and purposes, when he showed that a prime divides a 
product only if it divides one of the factors. Fermat saw trouble coming 
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in the behaviour of x? + 5y?, Lagrange found what the trouble was 
(class number greater than 1), and Gauss found an elaborate way round 
it (composition of forms). However, Gauss’s way out was a retreat from 
the direction already opened by Lagrange — the use of algebraic integers 
in the study of Z. 

The first to embrace the algebraic integers, for all their faults, was 
Kummer. He could see that unique prime factorisation was lost, but he 
hoped to regain it: 


It is greatly to be lamented that this virtue of the real numbers |i.e. the rational 
integers], to be decomposable into prime factors, always the same ones for a 
given number, does not also belong to the complex numbers |i.e. the integers of 
cyclotomic fields]; were this the case, the whole theory, which is still laboring 
under such difficulties, could easily be brought to its conclusion. For this 
reason, the complex numbers we have been considering seem imperfect, and 
one may well ask whether one ought not to look for another kind which would 
preserve the analogy with the real numbers with respect to such a fundamental 
property. (Translation by Weil (1975) from Kummer (1844).) 


The first sentence is the one quoted by Dedekind in Latin in his in- 
troduction: “Maxime dolendum videtur ...” (Kummer’s paper was one 
of the last important mathematical works written in Latin.) Dedekind 
goes on to say: 


But the more hopeless one feels about the prospects of later research on 
such numerical domains, the more one has to admire the steadfast efforts of 
Kummer, which were finally rewarded by a truly great and fruitful discovery. 


This was the discovery of ideal numbers. The rest of the story can be 
left to Dedekind himself. He explains Kummer’s ideal numbers with 
admirable clarity, and his own distillation of the concept of ideal from 
them. He also gives a candid account of the difficulties en route to unique 
prime factorisation of prime ideals. The concept of ideal is very simple, 
and so are the concepts of divisor and product, but unfortunately it is 
not clear that if ideal 6 divides ideal a then a = be for some ideal c. 
Dedekind was stymied by this difficulty for a long time, and here he 
explains how he overcame it after several years of struggle. The memoir 
is not only a superb exposition, but also a rare opportunity to see a great 
mathematician wrestling with a problem, with blow by blow comments 
up to the final victory. 


44 0.7 The reception of ideal theory 
0.7 The reception of ideal theory 


0.7.1 How the memoir came to be written 


Dedekind’s first exposition of ideal theory, Dedekind (1871), included 
proofs of unique prime ideal factorisation and finiteness of the class 
number, together with a very impressive application of the theory: a 
correspondence between the quadratic forms of discriminant D and the 
ideals of Q(\/D), under which the product of ideals corresponds to com- 
position of forms. Replacement of the complicated and mysterious forms 
by objects that behaved like integers should have been a revelation, but 
Dedekind’s contemporaries were slow to appreciate his achievement. 

There was great resistance to the idea of treating infinite sets as math- 
ematical objects. Dedekind had tried this in 1857 when he introduced 
congruence classes in place of specific residues, and again in 1872 with 
his definition of real numbers as Dedekind sections. But he was fighting 
2000 years of tradition (plus a formidable modern opponent, Leopold 
Kronecker). The “horror of infinity” that had haunted mathematics 
since Zeno was not to be dispelled overnight. Most mathematicians 
were not even willing to consider a theory based on infinite sets, let 
alone appreciate its power or elegance. 

Nevertheless, Dedekind tried again. The memoir on algebraic integers 
first appeared in instalments in the Bulletin des sciences mathématiques 
et astronomiques in 1877. The reason Dedekind chose this unusual outlet 
was a letter he received from Rudolf Lipschitz in 1876: 


Bonn, 11 March 1876 
Dear Colleague, 


To make the purpose of these lines clear, it is necessary to mention a few 
things first. You have perhaps seen that Darboux’ Bulletin has brought out 
a series of my works on the theory of homogeneous functions of an arbitrary 
number of differentials. Herr Darboux invited me to write such a series in 
1872, leaving the language up to me. The second editor of the Bulletin, Herr 
Hoiiel, who also translated Dirichlet’s German works for Liouville’s Journal, 
made a careful French translation, and since Herr Hoiiel has always been very 
attentive and obliging, I have remained in touch with him since. As I recently 
had cause to write to him, I brought up an idea I have long cherished, namely 
concerning your own investigations in the second edition of Dirichlet’s lectures 
on number theory, from §159 onward. In my opinion they are of rare value, 
and in Germany itself they have not received their full due. Therefore, it 
would be highly desirable if you could arrange to give a detailed analysis of 
these investigations in the Bulletin. Herr Hoiiel has now sent a reply, which 
I can best convey to you in his own words, and that is the purpose of my 
letter. He writes: [French text follows, a translation of which is] Our bulletin 
would receive with great pleasure a detailed analysis of M. Dedekind’s work. 


0.7.2 Later development of ideal theory 45 


I have communicated the substance of your letter to M. Darboux, who will 
himself write to M. Dedekind, and who will in any case be happy to publish 
his work. If you have occasion to write to the learned professor, you may tell 
him how flattered we will be by his collaboration, and thank him in advance 
for anything he is disposed to send us. [End of French text.] 

Of course, I cannot know whether you are inclined to undertake such a 
work. But I hope you will understand the true motive for the steps I have 
taken, namely, the keen desire for the mathematical public to learn the true 
value of first rate research. 

With cordiality and deep respect, 


Yours, 


R. Lipschitz 
(Translation of Lipschitz (1986), pp. 47-48.) 


On April 29 Dedekind wrote a long reply, the first part of which is: 
Dear Colleague, 


Your letter brought me great and unexpected joy, since for years I had more 
or less given up hope of interesting anybody in my general theory of ideals. 
With the exception of Professor H. Weber in Kénigsberg, with whom I have 
worked closely as editor of the forthcoming collected works of Riemann, and 
who has expressed his intention to acquaint himself with this theory, you are 
the first, not merely to show interest in the subject, but also in such a practical 
way, that it revives hope that my work may not have been in vain. I thought 
that the inclusion of this investigation in Dirichlet’s Zahlentheorie would be 
the best way to attract a wider circle of mathematicians to the field, but 
little by little I have become convinced that the presentation itself is to blame 
for the failure of this plan. I can only suppose that the presentation deterred 
readers through excessive brevity and terseness, and since autumn I have been 
spending my free time, obtained by resigning my three year directorship of the 
local polytechnic, working out a more detailed presentation of the theory of 
ideals, and I have come so far as to obtain a somewhat improved form of the 
essential foundation (the content of §163). (Translation of Dedekind’s letter 
to Lipschitz, Lipschitz (1986), pp. 48-49.) 


Correspondence about the series of articles continued until 12 August 
1876, as the details were worked out and Hoiiel was engaged to trans- 
late Dedekind’s manuscript into French. After their publication in the 
Bulletin, the articles were also published as a book (Dedekind (1877)). 


0.7.2 Later development of ideal theory 


Dedekind continued to present new versions of his ideal theory, revising 
it substantially for the third (1879) and fourth (1894) editions of Dirich- 
let’s Zahlentheorie, and expanding its interaction with Galois theory. He 
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and Weber also made a major application of ideal theory outside number 
theory, with their arithmetic theory of Riemann surfaces (Dedekind and 
Weber (1882)). This is the great fruit of their collaboration in the publi- 
cation of Riemann’s works — not only a turning point in the development 
of complex function theory, but also the beginning of modern algebraic 
geometry. (The “modern theory of functions” cited by Dedekind in §12 
of the memoir as the inspiration for proofs “based on fundamental char- 
acteristics, rather than on calculation,” is undoubtedly another reference 
to Riemann’s work.) Dedekind and Weber developed an ideal theory for 
functions, with polynomials playing the role of “rational” integers, and 
entire algebraic functions playing the role of “algebraic” integers. Once 
again, prime ideals are important, and unique prime factorisation is valid 
(in fact, somewhat easier to prove). 

In the same year, Kronecker (1882) published ‘a long account of his 
own theory of fields and integers, which he had been developing for some 
decades. His theory included an equivalent approach to unique prime 
factorisation, though expressed in a very different, and less readable, 
language. Kronecker was almost diametrically opposite to Dedekind in 
mathematical philosophy; he opposed nonconstructive methods, infinity, 
and in particular the use of infinite sets as mathematical objects. He 
even claimed not to believe in irrational numbers. Dedekind favoured 
the free use of infinite sets wherever it was necessary or convenient. His 
definitions of irrational numbers (as pairs of sets of rationals) and “ideal 
numbers” (as the sets he called ideals) should have made the convenience 
of his position clear, but in fact neither the Kronecker nor the Dedekind 
approach attracted an immediate following. 

As Artin (1962) said: 

Dedekind’s presentation is easy to read and elegant for us today, but at the 
time it was too modern. Thus the appearance of Hilbert’s Zahibericht in 
the Jahresbericht der Deutschen Mathematikervereinigung in 1897 was greeted 
with great joy. Hilbert presented all the results known up to that time and, 
through great simplifications, made the results of Kummer accessible to a 
wider circle of readers. (Translation of Artin (1962), p. 549.) 


The Zahlbericht, as its name suggests, was intended to be a report on 
number theory. Hilbert used groups, rings, fields and modules heavily, 
but with the clear aim of elucidating properties of numbers. In partic- 
ular, he used ideal theory to simplify Kummer’s results on the Fermat 
problem. 

It was only in the 1920s that Artin and Emmy Noether created mod- 
ern ideal theory by abstracting the properties of rings that make unique 


Acknowledgements 47 


factorisation into prime ideals possible, and using these properties as 
axioms. The subject then became part of ring theory, and Dedekind’s 
theories of algebraic numbers and algebraic functions became mere spe- 
cial cases, at least in theory. In practice, one still finds the best of both 
worlds in the theory of algebraic numbers. 
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Part two 
Theory of algebraic integers 


Introduction 


In response to the invitation it has been my honour to receive, I propose, 
in the present memoir, to develop the fundamental principles of the 
general theory of algebraic integers I published in the second edition of 
Dirichlet’s Vorlesungen tiber Zahlentheorie. Because of the extraordinary 
scope of this field of mathematical research, however, I restrict myself 
here to the pursuit of a single goal, which I shall try to clarify in the 
remarks that follow. 

The theory of divisibility of numbers, which is the basis of arithmetic, 
was established in its essentials by Euclid. At any rate, the fundamental 
theorem that each integer is uniquely decomposable into a product of 
primes is an immediate consequence of the theorem, proved by Euclid,f 
that a product of two numbers is not divisible by a prime unless the 
prime divides one of the factors. 

Two thousand years later, Gauss gave, for the first time, an extension 
of the notion of integer. The numbers 0,+1,+2,... previously going 
under that name, and which I shall call rational integers from now on, 
were extended when Gauss introduced{t complex integers of the form 
a+b/—1, where a and b are any rational integers. He showed that the 
general laws of divisibility of these numbers are identical with those that 
regulate the domain of rational integers. 

The broadest generalisation of the notion of integer is the following. 
A number @ is called an algebraic number if it satisfies an equation 


6” +.a,0"—! 4 a2"? +--+ + an_19 +n = 0, 
with finite degree n and rational coefficients a1, a2,...,@n-1,4n. It is 


t Elements, VII, 32. 
{ Theoria residuorum biquadraticorum, II; 1832. 
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called an algebraic integer, or simply an integer, when it satisfies an equa- 
tion of the form above in which all the coefficients a, a2,...,Q@n—1,@n 
are rational integers. It follows immediately from this definition that the 
sum, difference and product of integers are also integers. Consequently, 
an integer a will be said to be divisible by an integer ( if a = By, where 
+ is likewise an integer. An integer ¢€ will be called a unit when every 
integer is divisible by «. By analogy, a prime must be an integer a which 
is not a unit and which is divisible only by units € and products of the 
form ew. However, it is easy to see that, in the domain of all integers 
we are considering at present, primes do not exist, since every integer 
which is not a unit is always the product of two, or rather any number, 
of integral factors which are not units. 

Nevertheless, the existence of primes and the analogy with the do- 
mains of rational or complex integers re-emerges when we restrict our- 
selves to an infinitely small part of the domain of all integers, in the 
following manner. If @ is an algebraic number there is, among the in- 
finitely many equations with rational coefficients satisfied by 0, exactly 
one 


6g” +a,9"-} + les + an-194+ Gn = 0, 


of minimal degree, and which we call for this reason irreducible. If xo, 
21, 22, ..., Ln—1 denote arbitrary rational numbers, the numbers of the 


form 
(9) =2o+210+ x26" a In—10"—}, 


the set of which we call 2, will also be algebraic numbers, and they enjoy 
the fundamental property that their sums, differences, products and 
quotients also belong to the set 2. I call such a set 2 a field of degree} n. 
The numbers ¢(6) belonging to a field 9 are now partitioned, following 
the definition above, into two large sets: the set o of integers and the 
nonintegral, or fractional, numbers. The problem we set ourselves is to 
establish the general laws of division that govern such a system o. 

The system o is evidently identical with the system of all rational 
integers when n = 1, or with the complex integers when n = 2 and 
6 = /—I. Certain phenomena which occur in these two special domains 
e occur again in every domain o of this nature. Above all, the unlimited 
decomposition which prevails in the domain of all algebraic integers is 
never encountered in a domain o of the kind indicated, as one easily 
sees by consideration of norms. If we define the norm of any number 


{ Dedekind calls it “finite, of degree n”. (Translator’s note.) 
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pt = $(8) belonging to the field Q to be the product 
N(u) = bpape2..-Mn—1; 
whose factors are the conjugate numbers 
H=9(9), wi = (61), Ho=G(92), ---, Hn-1 = $(Pn-1) 


where 0,61, 02,...,9n—1 denote all the roots of the irreducible n** degree 
equation, then N() will always be, as we know, a rational number, and 
never 0 unless , = 0. At the same time, one always has 


N(af) = N(a)N(8) 


where a and # are any two numbers of the field 2. Now if y is an integer, 
and hence a number in 0, then the conjugate numbers j4;, }i2,.-.; tin—1 
will likewise be integers, and so N() will be a rational integer. 

This norm plays an extremely important role in the theory of numbers 
in the domain o. In fact, let any two numbers a, (@ in this domain 
be called congruent or incongruent relative to the third p, called the 
modulus, according as their difference +(a — ) is or is not divisible by 
p. Then we can, just as in the theory of rational or complex integers, 
partition all numbers of the system e into number classes, such that each 
class is the set of all numbers congruent to a given number (representing 
the class). And a deeper study shows us that the number of classes 
(with the exception of the case 4 = 0) is always finite, equal at most 
to the absolute value of N(). An immediate consequence of this result 
is that N() = £1 just in case yz is a unit. Now if a number in the 
system o is called decomposable when it is the product of two numbers 
of the system, neither of which is a unit, it evidently follows from the 
above that each decomposable number can always be represented as the 
product of a finite number of indecomposable factors. 

This result again corresponds completely with the law holding in the 
theory of rational or complex integers, namely that each composite num- 
ber is representable as the product of a finite number of prime factors. 
But at the same time this is the point where the analogy, observed un- 
til now with the old theory, is in danger of being irrevocably broken. 
In his researches on the domain of numbers belonging to the theory of 
circle division, hence corresponding to equations of the form 0” = 1, 
Kummer noted the existence of a phenomenon distinguishing the num- 
bers of that domain from those considered previously. They differ in a 
manner so complete and so essential as to leave little hope of preserving 
the simple laws that govern the old theory of numbers. In fact, whereas 
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each number in the domain of rational or complex integers decomposes 
uniquely into a product of primes, one discovers that, in the numerical 
domains considered by Kummer, a number may be representable in sev- 
eral entirely different ways as the product of indecomposable numbers. 
Or what amounts to the same thing, one discovers that the indecompos- 
able numbers lack the characteristic of genuine primes, inasmuch as a 
prime cannot divide a product of two or more factors without dividing 
at least one of the factors. 

But the more hopeless one feels about the prospects of later research 
on such numerical domains,{ the more one has to admire the steadfast 
efforts of Kummer, which were finally rewarded by a truly great and 
fruitful discovery. That geometer succeeded in resolving all the apparent 
irregularities in the laws.t By considering the indecomposable numbers 
which lack the characteristics of true primes to be products of ideal 
prime factors whose effect is only apparent when they are combined 
together, he obtained the surprising result that the laws of divisibility in 
the numerical domains studied by him were now in complete agreement 
with those that govern the domain of rational integers. Each number 
which is not a unit behaves consistently in all divisibility situations, 
whether as divisor or dividend, as a prime or as a product of prime 
factors, actual or ideal. Two ideal numbers, whether prime or composite, 
which yield two actual numbers when combined with the same ideal 
number are called equivalent, and all the ideal numbers equivalent to 
the same ideal number form a class of ideal numbers. The set of all 
actual numbers, considered as a special case of ideal numbers, forms the 
principal class. To each class there corresponds an infinite system of 
equivalent homogeneous forms, in n variables and of degree n, which 
are decomposable into n linear factors with algebraic coefficients. The 
number of these classes is finite, and Kummer succeeded in determining 
their number by extending the principles used by Dirichlet to determine 
the number of classes of binary quadratic forms. 

The great success of Kummer’s researches in the domain of circle di- 
vision allows us to suppose that the same laws hold in all numerical 


{ In the memoir: De numeris complexis qui radicibus unitatis et numeris integri 
realibus constant (Vrastislavize, 1844, §8), Kummer said “Maxime dolendum vide- 
tur, quod hc numerorum realium virtus, ut in factores primes dissolvi possint qui 
pro eodem numero semper iidem sint, non eadem est numerorum complexorum, 
quz si esset tota hzec doctrina, quz magnis adhuc difficultatibus laborat, facile 
absolvi et ad finem perduci posset”. (See the Introduction, Section 0.6.4, for an 
English translation of this passage. Translator’s note.) 

t Zur Theorie der complexen Zahlen (Crelle’s Journal, 35). 


Introduction 57 


domains o of the most general kind considered above. In my researches, 
the goal of which has been to arrive at a definitive answer to this ques- 
tion, I began by building on the theory of higher order congruences, 
since I had previously noticed that the latter theory allows Kummer’s 
researches to be shortened considerably. However, while this method 
leads to a point very close to my goal, I have not been able to surmount, 
by this route, certain exceptions to the laws holding in other cases. I did 
not achieve the general theory, without exceptions, that I first published 
in the place mentioned above, until I abandoned the old formal approach 
and replaced it by another; a fundamentally simpler conception focussed 
directly on the goal. In the latter approach I need no concept more novel 
than that of Kummer’s ideal numbers, and it is sufficient to consider a 
system of actual numbers that I call an ideal. The power of this concept 
resides in its extreme simplicity, and my plan being above all to inspire 
confidence in this notion, I shall try to explain the train of thought that 
led me to it. 

Kummer did not define ideal numbers themselves, but only the divis- 
ibility of these numbers. If a number a@ has a certain property A, to the 
effect that a satisfies one or more congruences, he says that a is divis- 
ible by an ideal number corresponding to the property A. While this 
introduction of new numbers is entirely legitimate, it is nevertheless to 
be feared at first that the language which speaks of ideal numbers being 
determined by their products, presumably in analogy with the theory of 
rational numbers, may lead to hasty conclusions and incomplete proofs. 
And in fact this danger is not always completely avoided. On the other 
hand, a precise definition covering all the ideal numbers that may be 
introduced in a particular numerical domain o, and at the same time a 
general definition of their multiplication, seems all the more necessary 
since the ideal numbers do not actually exist in the numerical domain o. 
To satisfy these demands it will be necessary and sufficient to establish 
once and for all the common characteristic of the properties A, B,C,... 
that serve to introduce the ideal numbers, and then to indicate how 
one can derive, from properties A, B corresponding to particular ideal 
numbers, the property C' corresponding to their product.{ 


{ The legitimacy, or rather the necessity, of such demands, which must always be 
imposed with the introduction or creation of new arithmetic elements, becomes 
more evident when compared with the introduction of real trrational numbers, 
which was the subject of a pamphlet of mine (Stetigkeit und irrationale Zahlen, 
Brunswick, 1872). Assuming that the arithmetic of rational numbers is soundly 
based, the question is how one should introduce the irrational numbers and define 
the operations of addition, subtraction, multiplication and division on them. My 
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This problem is essentially simplified by the following considerations. 
Since a characteristic property A serves to define, not an ideal number 
itself, but only the divisibility of the numbers in o by the ideal number, 
one is naturally led to consider the set a of all numbers a of the domain 
o which are divisible by a particular ideal number. I now call] such a 
system an ideal for short, so that for each particular ideal number there 
corresponds a particular ideal a. Now if, conversely, the property A of 
divisibility of a number a by an ideal number is equivalent to the mem- 
bership of a in the corresponding ideal a, one can consider, in place of 
the properties A, B,C,... defining the ideal numbers, the corresponding 
ideals a, b, c,..., in order to establish their common and exclusive char- 
acter. Bearing in mind that these ideal numbers are introduced with 
no other goal than restoring the laws of divisibility in the numerical 
domain o to complete conformity with the theory of rational numbers, 
it is evidently necessary that the numbers actually existing in o, and 
which are always present as factors of composite numbers, be regarded 


first demand is that arithmetic remain free from intermixture with extraneous 
elements, and for this reason I reject the definition of real number as the ratio 
of two quantities of the same kind. On the contrary, the definition or creation of 
irrational number ought to be based on phenomena one can already define clearly 
in the domain R of rational numbers. Secondly, one should demand that all real 
irrational numbers be engendered simultaneously by a common definition, and 
not successively as roots of equations, as logarithms, etc. Thirdly, the definition 
should be of a kind which also permits a perfectly clear definition of the calculations 
(addition, etc.) one needs to make on the new numbers. One achieves all of this 
in the following way, which I only sketch here: 

1. By a section of the domain R I mean any partition of the rational numbers 
into two categories such that each number of the first category is algebraically less 
than every number of the second category. 

2. Each particular rational number a engenders a particular section (or two 
sections, not essentially different) in which each rational number is in the first or 
second category according as it is smaller or larger than a (while a itself can be 
assigned at will to either category). 

3. There are infinitely many sections which cannot be engendered by rational 
numbers in the manner just described. For each section of this kind one creates 
or introduces into arithmetic a special irrational number, corresponding to the 
section (or engendered by it). 

4. If a, @ are any two real numbers (rational or irrational), then one easily 
defines a > G or a < @ in terms of the sections they engender. Moreover, one can 
easily define, in terms of these two sections, the four sections corresponding to the 
sum, difference, product and quotient of the two numbers a, @. In this way the 
four fundamental arithmetic operations are defined without any obscurity for an 
arbitrary pair of real numbers, and one can really prove propositions such as, for 
example, /2-/3 = V6, which had not previously been done, as far as I know, in 
the strict sense of the word. 

5. When defined in this way, the irrational numbers unite with the rational 
numbers to form a domain ® without gaps and continuous. Each section of this 
domain ® is produced by a particular number of the domain itself; it is impossible 
to engender new numbers in this domain ®. 
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as a special case of ideal numbers. Thus if y is a particular number 
of 0, the system a of all numbers a = pw in the domain o divisible by 
ut likewise has the essential character of an ideal, and it will be called 
a principal ideal. The latter system is evidently not altered when one 
replaces 4 by ey, where € is any unit in 0. Now, the notion of inte- 
ger established above immediately yields the following two elementary 
theorems on divisibility: 

1. If two integers a = pw, a’ = jw" are divisible by the integer py, then 
so are their sum a+a’ = p(w+u)’) and their difference a—a’ = u(w—w"), 
since the sum w + w’ and difference w — w' of two integers w, w’ are 
themselves integers. 

2. If @ = pw is divisible by , each number aw’ = p(ww") divisible by 
a will also be divisible by ys, since each product ww’ of integers w, w’ is 
itself an integer. 

If we apply these theorems, true for all integers, to the numbers w 
of our numerical domain 0, with yp denoting a particular one of these 
numbers and a the corresponding principal ideal, we obtain the following 
two fundamental properties of such a numerical system a: 

I. The sum and difference of any two numbers in the system a are 
always numbers in the same system a. 

II. Any product of a number in the system a by a number of the system 
o is a number in the system a. 

Now, as we pursue the goal of restoring the laws of divisibility in 
the domain o to complete conformity with those ruling the domain of 
rational integers, by introducing ideal numbers and a corresponding lan- 
guage, it is apparent that the definitions of these ideal numbers and their 
divisibility should be stated in such a way that the elementary theorems 
1 and 2 above remain valid not only when the number y is actual, but 
also when it is ideal. Consequently, the properties I and II should hold 
not only for principal ideals, but for all ideals. We have therefore found 
a common characteristic of all ideals: to each actual or ideal number 
there corresponds a unique ideal a, enjoying the properties I and II. 

A fact of the highest importance, which I was able to prove rigorously 
only after numerous vain attempts, and after surmounting the greatest 
difficulties, is that, conversely, each system enjoying properties I and II 
is also an ideal. That is, it is the set a of all numbers a of the domain 
o divisible by a particular number; either an actual number or an ideal 
number indispensable for the completion of the theory. Properties I and 
II are therefore not just necessary, but also sufficient conditions for a 
numerical system a to be an ideal. Any other condition imposed on the 
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numerical system a, if it is not simply a consequence of properties I and 
II, makes a complete explanation of all the phenomena of divisibility in 
the domain o impossible. 

This finding naturally led me to base the theory of numbers in the 
domain o on this simple definition, entirely free from any obscurity and 
from the admission of ideal numbers.} 

Each system a of integers in a field 2, possessing properties I and II, 
is called an IDEAL OF THAT FIELD. 

Divisibility of a number a by a number yz means that a is a number 
pw in the principal ideal corresponding to the number yu and which can 
be conveniently denoted by o(j) or ou. At the same time it follows from 
property II or theorem 2 that all the numbers in the principal ideal oa 
are also numbers in the principal ideal oy. Conversely, it is evident that 
a is certainly divisible by 4 when all numbers in the ideal oa, and hence 
a itself, are in the ideal oy. This leads us to establish the following 
notion of divisibility, not just for principal ideals, but for all ideals: 

An ideal a is said to be divisible by an ideal b, or a multiple of b, and 
6b is said to be a divisor of a, when all numbers in the ideal a are also in 
b. An ideal p, different from 0, which has no divisors other than 0 and 
p, is called a prime ideal.t 

Divisibility of ideals, which evidently includes that of numbers, must 
at first be distinguished from the following notion of multiplication and 
the product of two ideals: 

If a runs through all the numbers in an ideal a, and 8 runs through 
all the numbers in an ideal b, then all the products of the form a8, and 
all the sums of these products, form an ideal called the product of the 
ideals a, b, which we denote by ab.§ 

One sees immediately, it is true, that the product ab is divisible by a 
and b, but establishing the complete connection between the notions of 
divisibility and multiplication of ideals succeeds only after we have van- 
quished the deep difficulties characteristic of the nature of the subject. 
This connection is essentially expressed by the following two theorems: 

If the ideal ¢ is divisible by the ideal a, then there is a unique ideal b 
such that ab = c. 


It is of course permissible, though not at all necessary, to let each ideal a correspond 
to an ideal number which engenders it, if a is not a principal ideal. 

{ Likewise the ideal number corresponding to the ideal ab is said to be divisible by 
the ideal number corresponding to the ideal 6, and corresponding to a prime ideal 
one has a prime ideal number. 

§ The ideal number corresponding to the ideal ab is called the product of the two 
ideal numbers corresponding to a and b. 
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Each ideal different from o is either a prime ideal, or uniquely ez- 
pressible as a product of prime ideals. 

In the present memoir I confine myself to proving these results in a 
completely rigorous and synthetic way. This provides a proper founda- 
tion for the whole theory of ideals and decomposable forms, which offers 
to mathematicians an inexhaustible field of research. Of all the later de- 
velopments, for which I refer to the exposition I have given in Dirichlet’s 
Vorlesungen tiber Zahlentheorie and other memoirs still to appear, I have 
included here only the partition of ideals into classes, and the proof that 
the number of classes of ideals (or of classes of the corresponding forms) 
is finite. The first section contains only the propositions necessary for 
the present goal, extracted from an auxiliary theory, also important for 
other researches, which I shall expound fully elsewhere. The second sec- 
tion, which aims to clarify the general notions by numerical examples, 
can be omitted entirely. However, I have kept it because it may help in 
the understanding of the later sections, where the theory of integers in 
an arbitrary field of finite degree is developed from the above viewpoint. 
To do this one needs to borrow just the elements of the general theory of 
fields, a theory whose further development leads easily to the algebraic 
principles invented by Galois, which in their turn serve as a basis for 
deeper researches into the theory of ideals. 


1 


Auxiliary theorems from the theory of 
modules 


As I have emphasised in the Introduction, we shall frequently have to 
consider systems of numbers closed under addition and subtraction. The 
general properties of such systems form a theory so extensive that it can 
also be used in other researches; nevertheless, for our purposes just the 
elements of this theory are sufficient. In order to avoid later interruption 
to the course of our exposition, and at the same time to make it easier to 
understand the scope of the concepts on which our theory of algebraic 
numbers is based, it seems appropriate to begin with a small number 
of very simple theorems, even though their interest lies mainly in their 
applications. 


§1. Modules and their divisibility 


1. A system a of real or complex numbers will be called a module when 
all the sums and differences of these numbers also belong to a. 
Thus if @ is a particular number in the module a, all the numbers 


a+a=2a, 2a+a=3a, ..., 


a-a=0, 0-a=-a, -a-a=-—2a, ..., 


and consequently all numbers of the form za also belong to a, where x 
runs through all the rational integers, that is, all the numbers 


0, +1, +2,+3,.... 


Such a system of numbers ra: itself forms a module, which we denote 
by [a]. Consequently, if a module includes a nonzero number then it 
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includes an infinity of different numbers. It is also evident that the 
number zero, which is in each module, forms a module by itself. 


2. A module a will be called divisible by the module b or a multiple of 
b, and 6 a divisor of a, when all the numbers in the module a are also 
in the module b. 

The zero module is therefore a common multiple of all modules. Also, 
if a is a particular number in a module a, then the module [a] will be di- 
visible by a. Moreover, it is evident that any module is divisible by itself, 
and that two modules a, 6b divisible by each other are identical, which 
we shall denote by a = b. Finally, if each of the modules a,b,c,0,... 
is divisible by its immediate successor then it is clear that each will be 
divisible by all its successors. 


3. Let a, 6 be any two modules. The system m of all the numbers 
that belong to both modules will itself be a module. It will be called the 
least common multiple of a, 6 because each common multiple of a, 6 is 
divisible by m. 

Indeed, let 4, u’ be any two numbers in the system m, and hence in 
both a and 6b. Each of the two numbers p + p’ will belong (by 1) not 
only to the module a but also to the module b, and hence also to the 
system m, whence it follows that m is a module. Since all members of 
this module m are in a and also in b, m is a common multiple of a, b. 
Moreover, if the module m’ is any common multiple of a, 6, and thus 
composed entirely of numbers belonging to both a and b, then (by virtue 
of the definition of the system m) these numbers will also be in m, that 
is, m’ is divisible by m. 


4. If a becomes equal in succession to all the numbers in a module a, 
and £ to all the numbers in a module b, then the system 0 of all numbers 
a+ £ will form a module. This module is called the greatest common 
divisor of a, 6 because every common divisor of a, 6 is also a divisor of 
0. 

Indeed, any two numbers 6, 6’ in the system 0 can be put in the form 
6=a+, & =a! + B' where a, a’ belong to the module a and 8, 9’ to 
the module b, whence 


646 =(ata’')+(B+P); 


and, since the numbers a +a’ are in a and the numbers (7 + 2’ are in 6, 
the numbers 6 + 6’ also belong to the system 0. That is, 0 is a module. 
Since the number zero is in every module, all the numbers a = a + 0 
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of the module a and all the numbers 6 = 0+ £ of the module b belong 
to the module 0. Consequently, the latter is a common divisor of a and 
b. Also, if the module 0’ is any common divisor of a, 6, so that all the 
numbers in a and all the numbers in b are in 0’ then (by virtue of 1) all 
the numbers a+ (, that is, all the numbers in the module 0, also belong 
to the module 0’. Thus 2 is divisible by 0’. 


Having carried out these rigorous proofs, we need not explain further 
how the notions of least common multiple and greatest common divisor 
can be extended to any number (even an infinity) of modules. Neverthe- 
less, it may be useful to justify the terminology chosen, by the following 
remark. If a, b are two particular rational integers, m their least com- 
mon multiple and d their greatest common divisor, it follows from the 
elements of number theory that [m] will be the least common multiple, 
and [d] the greatest common divisor, of the modules [a], [b]. In any case 
we shall soon see that the number-theoretic propositions relevant to this 
case can also be deduced from the theory of modules. 


§2. Congruences and classes of numbers 


1. Let a be a module. Numbers w, w” will be called congruent or 
incongruent modulo a according as their difference +(w — w’) is in a 
or not. Congruence of the numbers w, w’ with respect to the module a 
will be indicated by the notation 


w =w)’ (mod a). 


We immediately deduce the following simple propositions, whose proofs 
we can omit: 

If w =’ (mod a) and u’ = w” (mod a) then w = w” (mod a). 

If w =w’ (mod a) and x is any rational integer then rw = xw’ (mod a). 

If w = w’ (mod a) and w” = w” (mod a) then w tw” = uw! tw" 
(mod a). 

If w =" (mod a) and 2 is a divisor of a, then w = w’ (mod 0). 

If w = wu’ (mod a) and w = w’ (mod b) then w = w’ (mod m), where 
m is the least common multiple of a, 6. 


2. The first of the preceding theorems leads to the notion of a class 
of numbers relative to a module a, by which we mean the set of those 
numbers congruent to a particular number, and hence to each other, 
modulo a. Such a class modulo a is completely determined by giving a 
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single member, and each member can be regarded as a representative of 
the whole class. The numbers in the module a, for example, form such 
a class, represented by the number zero. 

Now if 6 is a second module we can always choose in 6 a finite or 
infinite number of numbers 


(6,) A, Pa, Bs, ecary 


in such a way that each number in 6 is congruent modulo a to one of these 
numbers, and to only one. Such a system of numbers (, in the module 
6, which are mutually incongruent modulo a, but which represent all 
classes having members in b, I call a complete system of representatives 
of the module 6 modulo the module a, and the number of numbers f3,, or 
of the classes they represent, will be denoted by (6, a), when it is finite. 
If, on the contrary, the number of representatives G, is infinite, it will 
be convenient to assign the value zero to the symbol (6, a). A deeper 
examination of such a system (§,) of representatives now leads to the 
following theorem: 


3. Let a, 6 be any two modules, with least common multiple m and 
greatest common divisor 0. Any complete system of representatives of 
the module 6 modulo a will at the same time be a complete system of 
representatives of the module b modulo m, and for the module 0 modulo 
a; consequently we have 


(6, a) = (6, m) = (0,a). 


First of all, it is evident that any numbers £, 3’ in the module 6 which 
are congruent modulo a are congruent modulo m, because ( — ’ is in 
a as well as in b, and hence also in m. Now, since each number ( in 
the module 6 is congruent to one of the representatives £, modulo a, 
and hence also modulo m, and since any two different representatives 
are incongruent modulo a and hence also modulo m; these numbers (3, 
in b form a complete system of representatives of the module 6 modulo 
m. The second part is proved in absolutely the same way: since 6 is 
divisible by 0, the numbers 3, are likewise in 0 and, by hypothesis, they 
are incongruent modulo a. And, since each number 6 in 0 is of the form 
a+ 3, where a is in a and £ is in b, we have 


6=a+ = 8 (mod a), 


and, since 6 and consequently 6 is congruent to one of the numbers £, 
modulo a, the numbers (3, form a complete system of representatives for 
the module 0 modulo a. Q.E.D. 
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If 6 is divisible by a then (6,a) = 1, because all numbers in 6 are = 0 
(mod a). Conversely, if (b,a) = 1 then 6 will be divisible by a, since all 
numbers in b are congruent to each other and hence = 0 (mod a). We 
evidently have at the same time m = b, 0 = a. 


4. If 0 is a divisor of a and at the same time a multiple of ¢, moreover 
if @, runs through the representatives of b modulo a and if y, runs 
through the representatives of c modulo b, then the numbers 3, + ‘5 
form a complete system of representatives of the module ¢ modulo a, 
and consequently 


(c,a) = (c, b)(b, a). 


In the first place, all the numbers 3, +-y, belong to the module c, since 
G, is in b and hence also in c, and 7, is likewise in c. 

In the second place, they are all incongruent modulo a. In fact if we 
let 6’, B” be particular values of G,, and let 7’, 7” be particular values 
of y,, then the hypothesis @’ + 7/ = 6” + 7" (mod a) implies, since a is 
divisible by 6 and (’ = 3” = 0 (mod B), that y’ = 7” (mod 6). However, 
since 7’, 7’ are particular members of the series ,, any two of which are 
incongruent modulo 6, we must have +’ = +” and hence the hypothesis 
above becomes (3’ = 3” (mod a). Now since /’, 3” likewise are particular 
members of the series G,, any two of which are incongruent modulo a, 
we must have @’ = 3”, which proves the assertion above. 

In the third place, it remains to see that each number 7¥ in ¢ is con- 
gruent to one of the numbers (, + y,; modulo a. In fact, since each 
number 7 is congruent to one of the numbers y, modulo 6, we can 
choose y = 8+ 7s, where { is a number in the module b. Also, since 
each of these numbers ( is congruent to one of the numbers (, modulo 
a, we can choose 3 = a+ (,, where a is a number in the module a. We 
then have 


Y=B+%s =A+ Brt+ 7s = Br +s (mod a). 
QED. 


5. Let m be the least common multiple, and 0 the greatest common 
divisor, of two modules a, 6, and let p, o be given numbers. The system 
of two congruences 


w =p (mod a), w =o (mod b) 


has a common root if and only if 
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p=co (mod d), 
and all such numbers w form a class of numbers modulo the module m. 


If there is a number w satisfying the two congruences then the numbers 
w — p, w —o will be in a, b respectively, and since the latter are both 
in 0, the difference p — o of the numbers will likewise be in 0. That 
is, the above condition p = o (mod 0) is necessary. Conversely, if this 
condition is satisfied then (by virtue of the definition of 0 in §1,4) there 
is a number @ in a and a number ( in 6 whose sum a+ (7 = p—o, hence 
the number w = p - a =a + £ satisfies the two congruences. Thus the 
condition is also sufficient. Moreover, if w’ is a number satisfying the 
same conditions as w, then w’ —w will also be in both a and b, and hence 
also in m, which means that w’ = w (mod m). Conversely, each number 
w' in the class represented by w modulo m will satisfy the congruences. 
Q.E.D. 


§3. Finitely generated modules 
1. Let (1, Bo, 83,-.., 8, be particular numbers. All the numbers 


B= yi hi + yobe + y383 +--+ + YnBn; 


where y:, ya, Y3,---,Yn are arbitrary rational integers, evidently form a 
module, which we call a finitely generated} module [(1, Geo, 83,---, Bnl- 
The complex of constants 31, G2, G3,..., Gy will be called a basist of the 
module. 

This module [(1, 62,...,8n] is evidently the greatest common divi- 
sor of the n finitely generated modules [4], [@2],..-, [Bn]. It is easy to 
see that each multiple of a finitely generated module is itself a finitely 
generated module, but here I confine myself to proving the following 
fundamental theorem, which will later have important applications. 


2. If all the numbers in a finitely generated module 6b = [(;, (2,..., Bn] 
can be transformed into the members of a module a by multiplication 
by nonzero rational numbers, then the least common multiple m of a 


{ Dedekind calls them “finite”. (Translator’s note.) 
{ Note that Dedekind’s basis elements need not be independent. However, he re- 
quires them to be independent for fields (§15). (Translator’s note.) 
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and 6 will be a finitely generated module, and one can choose a system 
of $(n + 1)n rational integers a such that the n numbers 


M1 = a} fr, 
be = aff, + 4382, 
pin = a” By + af” Bo + af” Bg +--+ +a Bn, 
form a basis of m, and at the same time 
(6, a) = (6, m) = aj avay!.--al”), 
By hypothesis there are n nonzero fractions 
81 82 83 Sn 
ty’ te’ t3’ nee t2 
whose numerators and denominators are rational integers, such that the 


n products 


$3 Sn 


sy 82 

ty Ai, ty Ba, ts Bs, vee th Bn 
belong to the module a. Since members of a module a are changed into 
other members of a when multiplied by rational integers tj, te, t3,...,tn 


(§1,1), the products 1/1, 2/2, $33, .-., 8nBn likewise belong to a, and 
if s denotes the absolute value of the product s)$283---$,, the numbers 
831, $32, $83,...,$8n, and consequently all products sf, belong to the 
module a, where @ denotes any number in the module b. 

Now let v be a particular index from the sequence 1,2,...,n. Among 
the numbers in the module [(, G2, ..., G,] divisible by 6 let 

By = Br + yoo + +++ + wR 
denote those belonging to module a and hence also to module 6, for 
example s3,. Among these numbers yi, there will be at least one number 
fy, = af, + al Bp +--+ a\”)B,, 


for which y takes the smallest positive value al”. One can then see 
that, in all the numbers p/,, the coefficient y is divisible by at”) . This is 
because one can always put 


Ww zyal) + yy 
where x, and yj, are rational integers and the latter satisfies the conditiont 
0<y, <a. 


{ Which is the foundation of the theory of division for rational integers. 
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Then if we put 
wanna, h=n-na?, ., xa=wi-nal, 
the number 
By — Sv bv = yy Br + ygBe + +++ + Yp-1Bv—-1 + YB 


belongs to both the module [(;, 62,...,G,] and the module m, since pj, 
and p, are in m. But since (by definition of the ,) the coefficient of 
Gy is less than al” and at the same time positive, it is necessary that 
y!, = 0, and hence that y, = z,a!” be divisible by a{”, as required. At 
the same time 


My ~ Duby = By_1 


becomes a number in [(;, G2,..., 8-1] and also in m, or else becomes 
zero in the case v = 1. 

It follows easily that the n numbers p,, obtained by putting v = 
n,n—1,...,2,1 in succession, enjoy the properties enunciated in the 
theorem. Each number yp in the module m, that is, each number p/, in 
both a and 6 = [(1, f2,-..., Bn], is of the form 


= Py-1 + Lnbln 


where zp, is a rational integer and y/,_, is a number belonging to the 
two modules a and [(, 62,..., Bp—1], and hence also to the module m. 
Each number p/,_, of this nature is of the form 


, , 
Un—1 = Ung + Tn-1bn-1,; 


where r,-1 is a rational integer and yj,» is a number belonging to the 
two modules a and [/1, G2,...,;8n—1], and so on. Finally, each number 
1, belonging to the two modules a and [(;] is of the form 


My = Trp 


where 2, is a rational integer. Thus it is proved that each number yp of 
the module m can be represented in the form 


B= 2p + Lope +++ + Fnbln, 


where 21, 2%2,...,2n are rational integers. Conversely, since an arbitrar- 
ily chosen system of rational integers x1, 22,...,2£n, certainly produces a 
number yp in the module m, because ji, f2,.-., fn themselves belong to 


m, the n numbers 111, [/2,---, n form a basis of the module m. 
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To prove the last part of the theorem we have to consider all the 


numbers 
6 = 4Pr + Ato + Bn 
of the module 6 for which the rational integers z}, z5,..., 2}, satisfy the 
n conditions 
0<2z <a. 
We shall show that these numbers (’, the number of which is evidently 
equal to ajag --- a), form a complete system of representatives of the 


module 6 modulo m (or a). 
In the first place, all the numbers 3’ in the module b are incongruent 


modulo m. If 


281 +++ + 2,Bn = 218i +--+ + 2Bn (mod a), 


then the numbers 21’, z4,...,2z/. satisfy the same n conditions as the 
numbers 2}, 23,...,2),. Then if the n differences 
/ ” U n" / nN , u" 
en — Xn) 2n—-1 ~ ®n-1) eeey 2g — 22, 24 2 


are not all zero, let zj, — z’ be the first of them with a nonzero value, 
a value which we can assume to be positive by symmetry, and which is 
also < a”) since both numbers zj, and zi) are < a”), Then the difference 


(zy — 27) Bi +++ + (2%, — 2) 
is evidently a number pj, in a and [(), G2,..., Bn] for which the coefficient 
of §, is positive and < al! , contrary to the definition of the number 
fy. Thus any two different systems of n numbers 2}, 25,...,2),, which 
satisfy the conditions above, also produce two numbers {3 in the module 


b which are incongruent modulo a. 
In the second place, it is easy to see that an arbitrary number 


B= 2A + 2282 +---+2nBn 
in the module b is congruent modulo a (or m) to one of the numbers 
8" since, if z1,22,...,2n are given, it is clear that we can successively 
choose n rational integers 
Ln,In—-1,---,X2,2%1 
so that the n numbers 


i 
Zn = ent az, 


, (n) (n-1) 
Zy—1 = 2n-1 + Any Fn + An_y “Ln~1, 
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(n) (n—1) 


2 = 2 +3 In +05 ’Fn-1+-++ +4522, 


a=at a” ry + af? ge +:++»+alr2+a\21, 
satisfy the n conditions 0 < 2, < al). If we now put 
B= 281 + 2382 +++ + 2,Bns 


we have 


3 = B+ rip + Tope +--+ + 2npn, 


and hence 3 = {’ (mod m). Q.E.D. 


§4. Irreducible systems 


1. A system of n numbers a},Q2,...,Qn will be called an irreducible 
system, and its members will be called independent, when the sum 


Q@= 2101+ £202 +-'++FnAan 


is nonzero for any system of rational numbers 21, 2%2,...,2n which are 
not all zero. It then follows that any two different systems of ratio- 
nal numbers 21, 22,...,2n produce unequal sums a. In the contrary 
case, that is, when there is a system of rational numbers 21, 2%2,...,2n,; 
not all zero, for which the sum a is zero, then the system of numbers 
Q1,Q2,..-,Q@n, will be called reducible, and the numbers themselves will 
be called dependent on each other. If one wants to retain this terminol- 
ogy in the case n = 1, a single number evidently forms a reducible or 
irreducible system according as it is zero or not. The definition above 
easily yields the following theorems, whose number can be increased 
enormously, on the determinants of rational numbers. 


2. If the n numbers aj, Q2,...,@n, are independent, then the n num- 
bers 


/ / , 
Oy = C,Q1 + C902 +++-+C,0n, 
, Jt i i 
Gy = C1Q1 +CoQ2 +--+, an, 


af, = ay + ay +--+ Man, 
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whose n? coefficients c are rational numbers, form an irreducible or re- 
ducible system according as the determinant 


CaS tei cf- +c) 


is nonzero or not. 


Since a1, 02,...,Qn are independent, the sum 
, ! PF 
21Q, + 2X2Q_ +-'-+2%na, =a’, 
where £1, 22,..-,@y are arbitrary rational numbers, not all zero, cannot 
vanish unless we simultaneously have 


C21 + C12 i a tn = 0, 
Colq + CoL2 +++ +05 Fn =D, 
ety + cag +--+ +c az, = 0, 


which is impossible when C’is nonzero. Hence the numbers a4, a4,..., a, 


are independent in that case. But if we have C = 0 there is always a 
system of rational numbers 2x1, %2,..., pn satisfying the preceding equa- 
tions, and not all zero. This is seen immediately when all the n? coeffi- 
cients c vanish. If this is not the case then, among the minor determi- 
nants of C' that do not vanish there will be one, say 


> +¢)¢3 es of), 
of mazimal rank r < n such that the minor determinants of higher 
degree vanish. In this case, as we know, the last n — r of the equations 


above will be consequences of the preceding r, and we can put these r 
equations in the form 


, ’ 
21 = Pp4itr+1 test + Pptn, 


tr = pO tr41 a he +p zn, 


where the r(n—r) coefficients p are rational numbers. Now if we give the 
n—r quantities 7,41,...,£, arbitrary rational values then not only can 
we ensure that they are not all zero, the quantities 71,..., 2, will likewise 
take rational values. Thus we have a system of n rational numbers 
%1,@2,...,2n, not all zero, for which the sum a’ is zero. Hence in this 
case the n numbers a4, @4,...,a/, are dependent. Q.E.D. 
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3. If the n independent numbers a), Q2,...,@n, form one basis of a 
module a, and a}, a,...,a/, form another, then we have 


af = Ma, + Mar +---+ Man, 


where the n? coefficients c are rational integers whose determinant is 
+1, and consequently the numbers a}, a4,...,a/, are also independent. 

In fact, since the numbers a/, are in the module a = [a1,a2,..., Qn] 
there are in any case n equations of the preceding form, in which the 
coefficients c are rational integers. Conversely, since the n numbers a, 
are in the module a = [a{,a5,...,a/,] there are also n equations of the 
form 

ay =e aot +eMah t--- +eMal,, 

whose coefficients e are likewise rational integers. Substituting in them 
the first n equations for the n numbers a/,, and bearing in mind that 
the n numbers qa, form an irreducible system, it follows that the sum 


Me 4 pte cl jv pee fell”) =lor0 


according as the indices v, v’ are equal or not. Then the product of the 


determinants 
So tehcg:- of). So tejes: fee el™) = 1, 


and, since each factor is a rational integer, 


do tee an, ef) =>) tele3 Un el?) = +1. 


Q.E.D. 


Conversely, it is clear that [a{,a5,...,a/,] = [a1,Q@2,...,Qn] when 
there are n equations of the form 
of = May +--+ Man, 
where the coefficients c are rational integers whose determinant = +1. 


4. If the n independent numbers (3), ..., 8, form the basis of a module 
b, and if n numbers aj,..., Qn, forming the basis of a module a, depend 
on them via n equations of the form 
ay = oa, i a bY) Bn, 
where the coefficients 6 are rational integers whose determinant B is 


nonzero, then the number of classes is 


(6, a) ia 
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Now each of the numbers f3;,..., 8,, and hence each number {, of the 
module [(1, G2,..., Gn] can be changed into a member of the module a 
by multiplying by a nonzero rational number B. It follows, since a is 
divisible by b and hence also equal to the least common multiple of a 
and 6, that a has (by §4,3) a basis of n numbers of the form 


a, =a 6, +a B+... taB,, 
whose coefficients a are rational integers chosen so that 
(b,a) = ajaf---a™ = S> +a}.a4 al), 


Moreover, since the n numbers a1,...,Q@n likewise form a basis of the 
module a and since (by 2) each of these two systems of n numbers is 
irreducible, because we assume this of the system (1,..., Gn, we then 
have (by 3) n equations of the form 


at, = Ma, feet ean, 
with rational integer coefficients c whose determinant 
>) teig ol) = 41. 


By replacing the numbers a},...,@,, by their expressions above in terms 
of the n independent numbers (1, ..., 3, we see, by comparison with the 
preceding expressions for the numbers a/, in terms of the same numbers 


Ay, Las »Bns that 
a) =a of) Bh + bl, fete oo, 


and consequently 


Sta} Qi = So te} oe). S540} + BM), 
Thus we have (6, a) = +B. Q.E.D. 


This important theorem can easily be extended (and even more simply 
by means of the theorem below) to the more general case where the 
coefficients b are fractional rational numbers. One then obtains the 


theorem 

(b,a) = +B(a,b), 
and each of the two numbers of classes, (a,b) and (b, a), can be deter- 
mined by a simple rule involving the determinant B and all its minor 
determinants. 


5. If only n among the m numbers a1, @2,...,Q@m forming a basis 
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of the module a are independent, then a has a basis consisting of n 


independent numbers a4, 04,...,a/,. 


The hypothesis of this theorem will evidently be satisfied whenever 
all the m numbers aQ,...,@m are expressible in terms of n independent 
numbers w1,...,Wn, as 


(4) 


Oy = rit (+) 


wy +1; we te +rMYun, 


where the system of coefficients 


/ / / 
1» 5, tig Tey 
La a iM 
TY], a5 wiage Tas 
(r) 
sv tee iby 98 
(m) —,(m) (m) 
Ti 5 T3209 ery Tr 


consists of rational numbers, at least one of whose 


nxn partial determinants R is nonzero. Otherwise, any n of the m num- 
bers a, would be dependent. Conversely, it follows from the hypothesis 
of the theorem, that the m numbers a, can always be expressed in terms 
of n independent numbers w,, by choosing the latter to be, for example, 
n numbers among the m numbers a, which form an irreducible system. 
Then, since the n + 1 numbers a@,,W1,...,Wn are dependent there is an 
equation, for each index yp, of the form 


ToMy, + LyW, + LoWe +++: + Tnw, = 0, 


with rational coefficients x not all zero. Moreover, since w1,w2,...,Wn 
are independent, rp must be nonzero, and consequently a, can be rep- 
resented, in the manner indicated, in terms of the numbers w,. Finally, 
since the m numbers a, include the n numbers w,, at least one of the 
determinants R will be nonzero. 

I shall therefore assume that the m numbers a, are represented, in 
the manner indicated, in terms of the n independent numbers w,, and 
I shall show that, no matter how the numbers w,, are chosen, there is a 
basis of the module a = [ai, @2,...,@m] consisting of n numbers a’, of 
the form 


al, = Muy + Pu. +: ~+eMw,, 


with rational coefficients c. To do this I remark first that we can evi- 
dently choose a positive integer k so that the mn products kr ) are 
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integers. If we now put 
Wy = kfi, We = kBo, seey Wn = kBn; 


and express the numbers a, in terms of the numbers , it follows 
that the module a = [a1,Q2,---,Qm] is divisible by the module 6 = 
[B1, B2,---, Bn], and hence it is the least common multiple of a, b. More- 
over, since the n numbers 3, become the n numbers w, when multiplied 
by & and the latter, when multiplied by a nonzero determinant R, be- 
come numbers of the form 


2101 + 22Q2+°:'+IlmAm, 


with rational coefficients z, it is clear that each number ( in the module 
b, when multiplied by a nonzero rational, itself becomes a number in the 
module a. It follows from this (by §3,2) that the least common multiple 
of the two modules a, 6 has a basis consisting of n numbers of the form 


a= al”) By + ol”) Bo ++ +al3,, 


with rational coefficients a for which the product a{a¥ --- a”) is nonzero. 
If we now re-express the numbers (,, in terms of the n numbers w, we 
can conclude that the assertion above is true, which at the same time 
proves the theorem. 

6. To the preceding proof I add the following remarks. Since the m 
numbers q, form a basis for the module a just as much as the n numbers 
a!,, there are m equations of the form 


ay, = pot, + pal, +--+ pHa, 
and n equations of the form 
a, = gar + qaz+--- +a!” am, 


where the 2mn coefficients p and q are all rational integers. By substi- 
tuting the first expressions in the second, and bearing in mind that the 
n numbers aj, are independent, we deduce that the sum 


dp + ip, +--+ gp =1 or 0, 


according as the m indices v, v’ from the series 1,2,...,n are equal 
or not. Then if P denotes any n x n partial determinant formed from 
the system of coefficients (p), and if Q denotes any determinant formed 
similarly from the system of coefficients (q), then we know that the sum 


S> PQ, 
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taken over all combinations of n different upper indices, is equal to 1, and 
consequently the determinants P have no common divisor. Conversely, 
this property of the determinants P is necessary if the n numbers ai, 
and also the m numbers 


ay = pai +--+ pal, 
are to form a basis of the module a. 


A system of coefficients such as (p) is evidently just a special case of 
the preceding coefficients (r). Now since the n numbers af, can likewise 
be represented in the form 


al, = Mun + we +--- +eM wn, 


with n? rational coefficients e whose determinant 
E= >> tele3 vel”) 
is nonzero, we get 
rH) = pel + pel +--+ pel. 


Consequently, the two determinants R, P corresponding to systems of 
coefficients (r), (p) satisfy the relation 


R=PE. 


The problem of finding all the systems (p) corresponding to a given 
system (r) can be solved in the most comprehensive and elegant manner 
by generalising a method applied by Gaussf in the special case in which 
one utilises identities between the partial determinants. However, this 
would lead us too far away from our present position, and I am content to 
have shown the existence of a system such as (p). One sees immediately 
(from 3) that one can derive from it all other systems (p) by composition 
with all possible systems of n? rational integers with determinant +1. 
In practice, that is, in any case where the coefficients r are given 
numerically and, without loss of generality, as integers, we arrive most 
promptly at the goal by a chain of elementary transformations. These 
are based on the evident proposition that a module [a1, a2,...,@m] is 
not altered when we replace a1, for example, by the number aq) + rap, 
where z is any rational integer. The partial determinants R° correspond- 
ing to all combinations of n numbers from the new basis 
af =a, + 202, a} = ar, a} = 03, Mts a, =Am, 


t Disquisitiones Arithmeticae, art. 234, 236, 279. 
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and to the new system of coefficients (r°), coincide in part with the 
determinants R corresponding to the old basis 


a, =a°2—za®, a2=af, azg=a’, ..., am =al. 
They are of the form R? = R; + zRe2, from which we easily deduce that 
the greatest common divisor EF of the determinants R is the same as 
that of the determinants R°. Thus the determinants R° cannot simul- 
taneously vanish. We shall now use these basis transformations of the 
module a as follows: 

The m coefficients rt ) of the number Wry, cannot all be zero, since 
then all the determinants R would be zero. Now if two of these co- 
efficients, say r/, and rj’, are nonzero, and if |r/,| > |r|, then we can 
choose a rational integer x such that |r/, + ar!"| < |r!|.t The elementary 
transformation above therefore gives us a new basis in which all the m 
coefficients rit ) except the first, r/,, remain the same, and this single 
coefficient is replaced by one of smaller absolute value. By repetition of 
this procedure we necessarily arrive at a basis in which all but one of 
the m coefficients of w, are zero. We denote the member of the basis 
for which the latter coefficient a” is nonzero by 


af, = alu, + af wy +++ +a Wn, 
and we keep it fixed in all subsequent transformations of the basis. The 
partial determinants corresponding to the actual basis either vanish, or 
are of the form Sa”), where S is an (n—-1) x (n—1) partial determinant 
corresponding to an arbitrary combination of n — 1 of the m — 1 mem- 
bers of the basis other than a/,, and which is formed from the (n — 1)? 
coefficients corresponding to w1,we,...,Wn—1. Since the determinants S' 
cannot all be zero, we now proceed with these m—1 members of the basis, 
treating w,_1 as we previously did w,, in working with the m numbers 
a, of the original basis. If we continue these transformations then we 
finally obtain a basis of a consisting of n numbers a4,a05,...,a/,_1,a/, 
of the form 

a, = aw, + ay we +--+aM wy, 
and m—n = s numbers af, a3,...,a% which are all zero, and which can 
therefore be omitted. The n nonzero coefficients a”? can be taken to 
be positive, since a/, can be replaced by —a/, without alteration of the 


t Here again is the same principle which is fundamental in the theory of rational 
integers. 
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module, and their product aja¥ - - a j is evidently the greatest common 
divisor FE of all the partial determinants R. 

In this way we obtain a second proof of the important theorem (5), 
and at the same time it is evident that, by composition of the successive 
transformations and their inversion, we can find the system of coefficients 
(p) as well as a system of coefficients (q). In fact, one first obtains m 


equations of the form 
= Dela, + Daa’ 
Vv Co 
or, since the s numbers af are zero, 
> pal, 


and since the determinant of each of the substitutions or transformations 
is equal to 1, the m x m determinant 


| rs an | rrr A 
a a a ee =) PH =i, 
om ph Ah. nm 


since the quantities H are s x s determinants complementary to the 
determinants P, and formed from the system of coefficients (h). By 
inversion, one obtains the adjoint determinant 


Gis “Ae RS: da MRE 
eS ee ee ee =S°QK =1, 
gd” oe q™ Kem) sits Km) 


where K denotes the determinant complementary to Q, and if P, Q are 
corresponding determinants then we know that H = Q, K = P. At the 
same time we obtain n equations of the form 


al, = > qo 


Bb 


and s equations of the form 
ie an ka, = 0. 
v1 


The latter equations are a new expression of the original supposition 
that only n of the m numbers a, are independent, and we have been 
able to base all this study on such a system of s equations. 

One can generally shorten the calculation itself by carrying out several 
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elementary transformations simultaneously. Suppose for example that 
m = 4, n = 2, whence s = 2, and 


Q, =21u;, G2 = Tui, +7we, ag = 9w, —3w2, a4 = Bw, + Que, 
giving 


Pes th a ait 
(r) rm, = 21, ry =7, ry’ =9, ry =8, 
m2 =0, rg=7, ry’ =-3, ry" =2. 


We obtain the six partial determinants 


Ri,2 = 147, Riz = —63, Ri = 42, 
R34 = 42, Rog = 42, Rog = —84, 


(R) 
using the abbreviation 
pip) a rit ph) = Ry py 
These determinants satisfy the identity 
Ri2R34 — Ri3Re4+ RijRo,3 = 0. 
Now since the smallest nonzero coefficient of wo is in a4 we form the 
new basis 
Gy = 04 = 21a, B2 = Ag — 304 = —17u + we, 
83 = a3 +204 = 250, + We, Bs = ag = Bw, + Quo, 
whence, conversely 
Oo, =fi, a2 =fet+3f4, a3 =f3-—2f1, a4 = fy. 
Now since w2 has 1 as its smallest nonzero coefficient, in (2 for example, 
we form the third basis 
1 = A) = 21uy, Yo = Be = —17w + we, 
3 = —B2 + B3 = 42u1, Ya = —282 + By = 42u, 
whence, conversely, 


A=n, Be=7e, Bs=7e+73, Ba = 272+ 74. 


At this stage, since 2 is the only number in which wa has a nonzero 
coefficient, and since 7; is the number, amongst the other three, in 
which w has its least coefficient 21, we form the fourth basis 


61 = 1 = 210, 62 = Yo = —17u1 + we, 
63 =-2y+73=0, 64 =—-2y71+ 74 = 0, 
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whence, conversely, 
m=61, ye=62, 73 = 261+63, Ya = 261 +64 =0. 


Since 63 = 64 = 0 the transformation is completed, and successive sub- 
stitutions give 


m1 = 63 = 61, 

a2 = 66, +762 +36, = 66, +76, 
ag = —26, -—362 +63 —-264 = —26; —36o, 
a = 26; +265 +6, = 26; +260, 


and conversely 


6; = Qa) = 21w4, 
6. = a2 —3a,g = —17w, +42, 
63 = -—20, —-ag +03 +50, = 0, 
64 = 2a, —2a2 +7a4 = 0. 


Since 61, 62, 63, 64 are the quantities which, in the general theory, we have 
denoted by a4, a, a{/, af, we have 


(p) Pi =1, PY = 6, py’ = -2, py” = 2, 
p,=0, py=7, py’ =—3, py” =2. 
Thus we obtain, for the determinants proportional to the R, 
(P) Pry2 = 7, Pr 3 —, —3, Pra >. 2, 
P34=2, Poa=—2, Po3 = —4, 


and likewise 
(q) q =1, q =0, qi’ =0, qi” = 0, 
%=0, H=1, a =0, of" =-3 
and 
(Q) { Qi12=1, Q1i3=0, Qi4=-3, 
Q34=90, Qe4=0, Qo3 =0. 


Finally, from the systems of coefficients 


(h) hi =0, hv =0, A’ =1, hi” = 0, 
hs = 0, hg = 3, hg! = —2, hg” ane 1, 

and 

(k) ki = —2, ky = —-1, ki’ = 1, ko aes 5, 
k= 2, ky =—-2, ki’ =0, ky” =7, 


we derive the determinants H,.. = Qu, and Ky = Py, comple- 
mentary to P,,,, and Q,,, respectively, 
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(H) Ai ,2 = hing” z= Agengs, Ai,3 = A{ehg =. hfhy”, Hi = Athy! = AYthg, 
Ha = hia ~Althh, Haag = Rh — hal’, Hag = hi all — hh, 


(K) { Kio = RU KY — RU KY, Keyg = Re RE RRM, Kya = RK — BOR, 
— pp pt — pp pe — pp pays 
K3,4 = ky ky — kiko, Kaa = ky'ky — ky ky’, = Ka,3 = ky kg” — kY"k, 


and the treatment of the example is complete. 

To conclude, I remark that applying this to the case n = 1 leads to 
the fundamental theorem on the greatest common divisor of an arbitrary 
number of rational integers, the foundation for the whole divisibility 


theory of these numbers. 


The researches in this first chapter have been expounded in a special 
form suited to our goal, but it is clear that they do not cease to be true 
when the Greek letters denote not only numbers, but any objects of 
study, any two of which a, @ produce a determinate third element yy = 
a+ @ of the same type, under a commutative and uniformly invertible 
operation (composition), taking the place of addition. The module a 
becomes a group of elements, the composites of which all belong to the 
same group. The rational integer coefficients indicate how many times 
an element contributes to the generation of another. 


2 
Germ of the theory of ideals 


In this chapter I propose, as indicated in the Introduction, to explain in 
a particular case the nature of the phenomenon that led Kummer to the 
creation of ideal numbers, and I shall use the same example to explain 
the concept of ideal introduced by myself, and that of the multiplication 
of ideals. 


§5. The rational integers 


The theory of numbers is at first concerned exclusively with the system 
of rational integers 0,+1,+2,+3,..., and it will be worthwhile to re- 
call in a few words the important laws that govern this domain. Above 
all, it should be recalled that these numbers are closed under addition, 
subtraction and multiplication, that is, the sum, difference and product 
of any two members in this domain also belong to the domain. The 
theory of divisibility considers the combination of numbers under mul- 
tiplication. The number a is said to be divisible by the number b when 
a = bc, where c is also a rational integer. The number 0 is divisible by 
any number; the two units +1 divide all numbers, and they are the only 
numbers that enjoy this property. If a is divisible by b, then +a will 
also be divisible by +b, and consequently we can restrict ourselves to 
the consideration of positive numbers. Each positive number, different 
from unity, is either a prime number, that is, a number divisible only 
by itself and unity, or else a composite number. In the latter case we 
can always express it as a product of prime numbers and - which is the 
most important thing - in only one way. That is, the system of prime 
numbers occurring as factors in this product is completely determined 
by giving the number of times a designated prime number occurs as a 
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factor. This property depends essentially on the theorem that a prime 
divides a product of two factors only when it divides one of the factors. 

The simplest way to prove these fundamental propositions of number 
theory is based on the algorithm taught by Euclid, which serves to find 
the greatest common divisor of two numbers.t This procedure, as we 
know, is based on repeated application of the theorem that, for a positive 
number m, any number z can be expressed in the form gm +r, where 
q and r are also integers and r is less than m. It is for this reason that 
the procedure always halts after a finite number of divisions. 

The notion of congruence of numbers was introduced by Gauss.{ Two 
numbers z, 2’ are called congruent modulo the modulus m, written 


z=2' (mod m), 


when the difference z—z’ is divisible by m. In the contrary case z and 2’ 
are called incongruent modulo m. If we arrange the numbers in classes, 
with two numbers in the same class§ only if they are congruent modulo 
m, then we easily conclude from the theorem recalled above that the 
number of classes is finite and equal to the absolute value of the modulus 
m. This also follows from the studies of the preceding chapter, since the 
definition of congruence in Chapter 1 contains that of Gauss as a special 
case. The system o of all rational integers is identical with the finitely 
generated module [1], and likewise the system m of all numbers divisible 
by m is identical with [m]. The congruence of two numbers modulo m 
coincides with congruence modulo the system m. Thus (by §3,2 or §4,4) 
the number of classes is (0, m) = tm. 


86. The complex integers of Gauss 


The first and greatest step in the generalisation of these notions was 
made by Gauss, in his second memoir on biquadratic residues, when he 
transported them to the domain of complex integers x + yi, where x and 
y are any rational integers and i is /—1, that is, a root of the irreducible 
quadratic equation i? + 1 = 0. The numbers in this domain are closed 
under addition, subtraction and multiplication, and consequently we 
can define divisibility for these numbers in the same way as for rational 


f See, for example, the Vorlesungen tiber Zahlentheorie of Dirichlet. 

} Disquisitiones Arithmeticae, art. 1. 

§ The word class seems to have been employed by Gauss first 4 propos of complex 
numbers. (Theoria residuorum biquadraticorum, II, art. 42.) 
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numbers. One can establish very simply, as Dirichlet showed in a very 
elegant manner,€ that the general propositions on the composition of 
numbers from primes continue to hold in this new domain, as a result of 
the following remark. If we define the norm N(w) of a number w = u+vi, 
where u and v are any rational numbers, to be the product u? + v? of 
the two conjugate numbers u+vi and u—vi, then the norm of a product 
will be equal to the product of the norms of the factors, and it is also 
clear that for any given w we can choose a complex integer q such that 
N(w—q) < 1/2. If now we let z and m be any complex integers, with m 
nonzero, it follows by taking w = z/m that we can put z = gm+r where 
q and r are complex integers such that N(r) < N(m). We can then 
find a greatest common divisor of any two complex integers by a finite 
number of divisions, exactly as for rational numbers, and the proofs of 
the general laws of divisibility for rational integers can be applied word 
for word in the domain of complex integers. There are four units, +1, 
+i, that is, four numbers which divide all numbers, and whose norm 
is consequently 1. Every other nonzero number is either a composite 
number, so called when it is the product of two factors, neither of which 
is a unit, or else it is a prime, and such a number cannot divide a product 
unless it divides at least one of the factors. Every composite number can 
be expressed uniquely as a product of prime numbers, provided of course 
the four associated primes +q, -+-qi are regarded as representatives of the 
same prime number g. The set of all prime numbers g in the domain of 
complex integers consists of: 

1. All the rational prime numbers (taken positively) of the form 4n+3; 

2. The number 1 +i, dividing the rational prime 2 = (1+ i)(1 —%) = 
—i(1 + i); 

3. The factors a+ bi and a — bi of each rational prime p of the form 
4n + 1 with norm a? + b? = p. 

The existence of the primes a + bi just mentioned, which follows 
immediately from the celebrated theorem of Fermat on the equation 
p = a” + 6, and which likewise implies that theorem, can now be de- 
rived without the help of the theorem, with marvellous ease. It is a 
splendid example of the extraordinary power of the principles we have 
reached through generalisation of the notion of integer. 

Congruence of complex integers modulo a given number m of the same 
kind can also be defined in absolutely the same way as in the theory of 
rational numbers. Numbers z, z’ are called congruent modulo m, written 


4 Recherches sur les formes quadratiques & coefficients et a indéterminées complexes 
(Crelle’s Journal, 24). 
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z = 2’ (mod m), when z—2z’ is divisible by m. If we arrange the numbers 
into classes, with numbers being in the same class or not according as 
they are congruent or incongruent modulo m, then the total number of 
different classes will be finite and equal to N(m). This follows very easily 
from the researches of the first chapter, since the system o of all complex 
integers x + yi forms a finitely generated module [1,1] and likewise the 
system m of all the numbers m(z + yi) divisible by m forms the module 
[m, mi], whose basis is related to that of o by two equations of the form 


m=a-1+b-i, mi=—b-1+a-i. 


Consequently we have (§4,4) 


= N(m). 


a b 
a 


(o,m) = =} 


87% The domain o of numbers x + y/—5 
There are still other numerical domains which can be treated in abso- 
lutely the same manner. For example, let 6 be any root of any of the 
five equations 


674+64+1=0, 674+042=0, 
64+2=0, 6-2=0, &-3=0, 


and let x, y be any rational integers. Then the numbers z + y@ form a 
corresponding numerical domain. In each of these domains it is easy to 
see that one can find the greatest common divisor of two numbers by a 
finite number of divisions, so that one immediately has general laws of 
divisibility agreeing with those for rational numbers, even though there 
happen to be an infinite number of units in the last two examples. 

On the other hand, this method is not applicable to the domain o of 
integers 


w=2+y0 
where @ is a root of the equation 
0 +5=0, 


and 2, y again take all rational integer values. Here we encounter the 
phenomenon which suggested to Kummer the creation of ideal numbers, 
and which we shall now describe in detail by means of examples. 
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The numbers w of the domain o we shall now be concerned with are 
closed under addition, subtraction and multiplication, and we therefore 
define the notions of divisibility and congruence of numbers exactly as 
before. Also, if we define the norm N(w) of a number w = z + y@ to be 
the product x? +5y of the two conjugate numbers x+y, then the norm 
of a product will be equal to the product of the norms of the factors. 
And if py is a particular nonzero number we conclude, just as before, 
that N() expresses how many mutually incongruent numbers there are 
modulo yw. If « is a unit, and hence divides all numbers, then we must 
have N() = 1 and therefore » = +1. 

A number (different from zero and +1) is called decomposable when it 
is the product of two factors, neither of which is a unit. In the contrary 
case the number is called indecomposable. Then it follows from the 
theorem on the norm that each decomposable number can be expressed 
as the product of a finite number of indecomposable factors. However, 
in infinitely many cases an entirely new phenomenon presents itself here, 
namely, the same number is susceptible to several, essentially different, 
representations of this kind. The simplest examples are the following. 
It is easy to convince oneself that each of the following fifteen numbers 
is indecomposable. 


a=2, b= 3, c=7; 

bb =-2+0, bb =-2-60; cc =24+30, co =2- 30; 
dj =1+80, dg =1-9; €:=3+06, eg =3-9; 
fi =—-1+26, fo=-1-—20; gp =4+0, go=4-89. 


In fact, for a rational prime p to be decomposable, and hence of the 
form ww’, it is necessary that N(p) = p? = N(w)N(w’), and since w, 
w’ are not units we must have p = N(w) = N(w’), that is, p must 
be representable by the binary quadratic form x? + 5y?. But the three 
prime numbers 2, 3, 7 cannot be represented in this way, as one sees from 
the theory of these forms,} or else by a small number of direct trials. 
They are therefore indecomposable. It is easy to show the same thing, 
similarly, for the other twelve numbers, whose norms are products of two 
of these three primes. However, despite the indecomposability of these 
fifteen numbers, there are numerous relations between their products, 
which can all be deduced from the following: 


(1) ab = dido, b? = bi be, ab, = d3, 


{ See Dirichlet’s Vorlesungen tiber Zahlentheorie, §71. 
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(2) ac = e€1€9, ce? =c1e2, ac, = e?, 
(3) be = fife = 9192, af, = d\e1, ag, = dyeé2. 


In each of these ten relations, the same number is represented in two or 
three different ways as a product of indecomposable numbers. Thus one 
sees that an indecomposable number may very well divide a product 
without dividing any of its factors. Such an indecomposable number 
therefore does not possess the property which, in the theory of rational 
numbers, is characteristic of a prime number. 

If we imagine for a moment that the fifteen preceding numbers are 
rational integers then, by the general laws of divisibility, we easily deduce 
from the relations (1) that there are decompositions of the form{ 


a=po*?, d,=pah;, d2 = pape, 
b= HB; Bo, by = uB?, be aa u83, 


and from the relations (2) that there are decompositions of the form 


a= pa, €) =pWa'y, €2 = pay, 
yf = hd he, 
cCHeUNY2 A =H, C2 =-Y2; 


where all the Greek letters denote rational integers. And it follows im- 

mediately, by virtue of the equation pa? = pa! _ that the four numbers 

fi, fe, 91, g2 appearing in the relations (3) will likewise be integers. 

These decompositions are simplified if we make the additional assump- 

tion that a is prime to 6 and c, since this implies uw = p’ = 1, a = a’ 

and hence the fifteen numbers can be expressed as follows, in terms of 

five numbers a, (1, 82, ¥1, Y2: 

t Since these decompositions do not seem obvious to me, I include the following 
proof of the consequences of (1) as an example. Note first that abi = ad and 
bi bz = b? are both squares. Suppose that 

a=po’, b1=116?, be = u23, 
where 4, 141, #2 are squarefree. Then abi = pp.a? B2 is not a square unless 
f= 1. Similarly, 6162 is not a square unless 441 = 2. Thus in fact wp = uw. = po 
and hence 

a=pa", b= "G?, be = p63. 

Forming products of these, we get 

d? = ab; = p?a7B? = di = pop, 

a3 = abg = p07 62 > de = pape, 

b? = bib = 6763 => b= uP: h2, 
which completes the proof of the decompositions claimed by Dedekind. (Transla- 
tor’s note.) 
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a=a"?, b=fAifo, c= N723 

(4) b= 62, b=; a=%, e=% 
d,=af;}, d,=afo; eae: =ayi, €2 = a2; 
fi=Bin, fe = Gere; 9 =fiye, ge= hen. 


Now even though our fifteen numbers are in reality indecomposable, the 
remarkable thing is that they behave, in all questions of divisibility in 
the domain 0, exactly as if they were composed, in the manner indicated 
above, of five different prime numbers a, (1, G2, Y1, Y2- In a moment I 
shall explain in detail what these relations between numbers mean. 


§8. Role of the number 2 in the domain o 


Let me begin by remarking that, in the theory of rational integers, one 
can recognise the essential constitution of a number without effecting 
its decomposition into prime factors, observing only how it behaves as 
a divisor. If we know, for example, that a positive number a does not 
divide a product of two squares unless at least one of the squares is 
divisible by a, then we can conclude with certainty that a is either 1, 
a prime or the square of a prime. It is likewise certain that a number 
@ must contain at least one square factor, other than unity, when we 
can prove the existence of a number not divisible by a, whose square is 
divisible by a. Thus if we can ascertain that both these two properties 
hold for a, then we can conclude with certainty that a is the square of 
a prime number. 

We shall now examine the behaviour, in this sense, of the number 2 in 
our domain o of numbers w = x + y@. Since any two conjugate numbers 
are congruent modulo 2 we have 


w? = N(w) (mod 2), 


and hence also w2w’? = N(w)N(w’) (mod 2). Now, if the number 2 is to 
divide the product w?w’ om and hence also the product of the two rational 
numbers N(w), N(w’), it is necessary for at least one of these norms, and 
hence also for at least one of the two squares w, w’ 2’ to be divisible by 
2. Moreover, if we take x, y to be any two odd numbers, then we obtain 
a number w = x + y@ not divisible by 2, and whose square is divisible 
by 2. Having regard to the preceding remarks on rational numbers, we 
then say that the number 2 behaves in our domain o as though it were 
the square of a prime number a. 
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Although such a prime number a does not actually exist in the domain 
o, it is by no means necessary to introduce it, since in fact Kummer 
managed in similar circumstances with great success by taking such a 
number a to be an ideal number, and we are allowing ourselves to be 
guided by analogy with the theory of rational numbers to define the 
presence of the number a in terms of existing numbers w of the domain 
o. Now, when a rational number a is known to be the square of a prime 
a@ we can easily judge, without bringing in a, whether a is a factor of an 
arbitrary rational integer z, and how many times. It is clear that z is 
divisible by a” if and only if z? is divisible by a”. Thus we extend the 
criterion to the case we are interested in by saying that a number w of 
the domain o is divisible by the nth power a” of the ideal prime number 
a when w? is divisible by 2”. Experience will show that this definition is 
very luckily{ chosen, because it leads to a mode of expression in perfect 
harmony with the laws of the theory of rational numbers. 

It follows first, for n = 1, that a number w = x + y@ is divisible by a 
if and only if N(w) is an even number, and consequently 


(a) x =y (mod 2). 
The number w is not divisible by a when N(w) is an odd number, and 


consequently x = 1+y (mod 2). From this we get the theorem expressing 
the character of the ideal number @ as a prime number: 


“The product of numbers not divisible by a is also not divisible by 


” 


a. 


As far as higher powers of a are concerned, we first conclude from 
the definition that a number w divisible by a” is also divisible by all 
lower powers of a, because a number w” divisible by 2” is also divisible 
by all lower powers of 2. We now have to find, when w is nonzero, the 
highest power a™ of a@ that divides w, that is, the highest power of 2 
that divides w?. Let s be the exponent of the highest power of 2 that 
divides w itself. We have 


w = uy = 2°(21 + 919), 
and at least one of the two rational integers 21, y; will be odd. If both 
are odd, w, will be divisible by a and we shall have 
wi = 2? — By? + 221410 = Qu, 
+ Luckily, since, for example, trying analogously to determine the role of the number 


2 in the domain of numbers x + y\/—3 leads to complete failure. Later we shall 
clearly see the reason for this phenomenon. 
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where w2 = £2 + y20 is not divisible by a, because x2 is even and y2 is 
odd. But if one of the two numbers 71, y; is even, so that the other is 
odd, then uw; and consequently w? will not be divisible by a. Thus in 
the first case m = 2s +1, in the second case m = 2s, but in both cases 
w* = 2™w’ where w’ is a number not divisible by a. We see at the same 
time that m is also the exponent of the highest power of 2 that divides 
the norm N(w). We therefore have the theorem: 


“The exponent of the highest power of a that divides a product is 
equal to the sum of the exponents of the highest powers of a that divide 
the factors”. 


It is likewise evident that each number w divisible by a?” is also 
divisible by 2” since, if the exponent denoted above by s is < n, then 
the numbers 2s, 2s + 1 and hence also m will be < 2n, contrary to 
hypothesis. It follows immediately from the definition that, conversely, 
each number divisible by 2” is also divisible by a2”. 

Since the number 1 + 9 is divisible by a, but not by a”, we easily 
see, with the help of the preceding theorem, that the congruence w? = 0 
(mod 2”) that serves to define divisibility of the number w by a” can be 
replaced by the congruence 


(a”) w(1 +0)” =0 (mod 2"), 


which has the advantage of containing the number w only to the first 
power. 


§9. Role of the numbers 3 and 7 in the domain o 


When all the quantities appearing in the equations (4) of §7 are rational 
integers, and if at the same time a is prime to b and c, then it is evident 
that a rational] integer z will be divisible by 61, 62, 71, y2 according as 
it satisfies the corresponding congruences 


zdg=0, 2zd,=0 (mod d), 

zeg =0, zey=0 (modc). 
These congruences have the peculiarity that they do not involve the 
numbers (1, 82, 71, Y2 themselves, and it is for precisely this reason 


that they are appropriate for introducing the four ideal numbers (, (2, 
71, Y2 in the context of numbers in the domain 0. We say that a number 


92 Chapter 2. Germ of the theory of ideals 


w=2+y6 is divisible by one of these four numbers if w is a root of the 
corresponding congruence 


(1-@w=0, (1+6)w=0 (mod 3), 
(3-0)w=0, (3+0)w=0 (mod 7). 


Multiplication converts these congruences to the following: 


(81) xz=y (mod 3), 
(G2) =-y (mod 3), 
("1) 2=3y (mod 7), 
(72) x2 =-3y (mod 7), 


concerning which we add the following remarks. 

Each of these conditions can be satisfied by one of the numbers w = 
1+6,1-—0,3+6, 3—86, and the number in question does not satisfy 
any of the other three, so that it is legitimate to say that the four ideal 
numbers are all different. Moreover, since every number w divisible by 
1 and 2 is also divisible by 3, since x = y = —y = 0 (mod 3) in that 
case, and since conversely every number divisible by 3 is also divisible 
by (1 and (2, we ought to regard 3 as the least common multiple of the 
ideal numbers (;, G2, by analogy with the theory of rational numbers. 
But each of these two ideal numbers also has the character of a prime 
number, that is, it does not divide a product ww’ unless it divides at 
least one of the factors w, w’. In fact if we put 


w=r2t+yO, w=a2'+y6, w=! =27"+y"8, 


then we have 
gl" = az! _ 5yy’, y” = zy’ + yx’, 


and hence 
x" ty" =(r@ty)(x’+y’') (mod 3), 


which immediately justifies our assertion, bearing in mind the congru- 
ences (1), (2) above. Because of this, the number 3 ought to be con- 
sidered, from a certain point of view, as the product of the two different 
ideal prime numbers (,, Be. 

Moreover, since each of the ideal prime numbers (1, 2 is different (in 
the sense indicated above) from the ideal prime number a introduced 
above, then in view of the fact that 2 behaves like the square of a@ and 
1+ @ is divisible by a and £,, and 1 — 0 by a and fe, we ought to 
conclude from the equation 2-3 = (1+ 6)(1 — 6) that 1+ behaves like 
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the product of a and {;, and 1 — @ like the product of a and (2. In fact 
this presumption is plainly confirmed: each number w = z+ y@ divisible 
by 1+ @ is in fact divisible by a and (, because 


x+y0 = (1+ 6)(2'+y'8) 


implies 
g=a2'—-5y, y=a'ty, 
and consequently 
xz = y (mod 2), x =y (mod 3). 


Conversely, each number w = x + y@ divisible by a and (1, that is, sat- 
isfying the two preceding congruences, is also divisible by 1+ 6, because 
we have y = x + 6y’ and consequently 


z+ yO = (1+ 6)(a + 5y’ + y'6). 


We can now also introduce the powers of the ideal prime numbers (1, 
2, as we have done above for powers of the ideal number a. By analogy 
with the theory of rational numbers, we define divisibility of an arbitrary 
number w by (7 or BF by the respective congruences 


(47) w(1— 6)" =0 (mod 3”), 
(63) w(1 +0)" = 0 (mod 3"), 


and this yields a series of theorems which agree perfectly with those of 
the theory of rational numbers. We treat the ideal prime numbers 7, 
‘2 in the same way. 


810. Laws of divisibility in the domain o 

By similar study of the whole domain o of numbers w = x + y@ we find 
the following results: 

1. All the positive rational primes = 11,13,17,19 (mod 20) behave 
like actual prime numbers. 

2. The number @ with square —5 has the character of a prime number. 
The number 2 behaves like the square of an ideal prime number a. 

3. Each positive rational prime = 1,9 (mod 20) can be decomposed 
into two different factors, which really exist and have the character of 
primes. 
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4, Each positive rational prime = 3,7 (mod 20) behaves like the 
product of two different ideal prime numbers. 

5. Each actual number w different from zero and +1 is either one of 
the numbers mentioned above as having the character of a prime, or else 
it behaves in all questions of divisibility as a unique product of actual 
or ideal prime factors. 

However, to arrive at this result and to become completely certain 
that the general laws of divisibility governing the domain of rational 
numbers extend to our domain o with the help of the ideal numbers we 
have introduced,f it is necessary, as we shall soon see when we attempt 
a rigorous derivation, to make a very deep investigation, even supposing 
knowledge of the theory of quadratic residues and binary quadratic forms 
(a theory which, conversely, can be derived with great facility from the 
general theory of algebraic integers). We can indeed reach the proposed 
goal with all rigour; however, as we have remarked in the Introduction, 
the greatest circumspection is necessary to avoid being led to premature 
conclusions. In particular, the notion of product of arbitrary factors, 
actual or ideal, cannot be exactly defined without going into minute 
detail. Because of these difficulties, it has seemed desirable to replace 
the ideal number of Kummer, which is never defined in its own right, 
but only as a divisor of actual numbers w in the domain 0, by a noun for 
something which actually exists, and this can be done in several ways. 

One can, for example (and if I am not mistaken, this is the way chosen 
by Kronecker in his researches), replace the ideal numbers by actual 
algebraic numbers, not from the domain 0, but rather adjoined to this 
domain in the sense of Galois. Indeed, if we put 


Ai =V-2+48, fo = V-2-8, 
and if we choose the square roots so that (12 = 3 then we have 


@P=-5, Pi=-2+0, Sy =-2-8, 


Pipe =3, 02; =—26, —3f2, O62 = 36; + 2fo, 
whence it follows that the quadrinomial numbers 


2+ yO + 2131 + z2fe, 


t To some people it seems evident a priori that the establishment of this harmony 
with the theory of rational numbers can be imposed, whatever happens, by the 
introduction of ideal numbers. However the example, given above, of the irregular 
role of the number 2 in the domain of numbers z + y./—3 suffices to dispel this 
illusion. 
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where z, y, 21, 22 are rational integers, are closed under addition, sub- 
traction and multiplication. The domain o’ of these numbers contains 
the domain o, and all the ideal numbers needed for the latter can be 
replaced by actual numbers of the new domain o’. For example, by 
putting 

a=fi+ Pe, 1 =201+ 62, Y2= Git 262 


all the equations (4) of §7 are satisfied. Likewise, the two ideal prime 
factors of the number 23 in the domain o are replaced by the two actual 
numbers 23, — G2 and —@; + 2G of the domain o’, and it is the same 
for all the ideal numbers of the domain o. 

Although this way is capable of leading to our goal, it does not seem 
to me as simple as desirable, because one is forced to pass from the given 
domain o to a more complicated domain o’. It is also easy to see that 
the choice of the new domain o’ is highly arbitrary. In the Introduction 
I have explained in detail the train of thought that led me to build this 
theory on quite a different basis, namely on the notion of ideal, and it 
would be superfluous to come back to it here; hence I shall confine myself 
to illustrating the notion by an example. 


§11. Ideals in the domain o 
The condition for a number w = x + y@ to be divisible by the ideal 
prime number a is that x = y (mod 2), by §8. Thus to obtain the 
system a of all numbers w divisible by a we put x = y+2z, where y and 
z are arbitrary rational integers. The system a therefore consists of all 
numbers of the form 2z+(1+6)y, that is, a is a finitely generated module 
with basis consisting of the two independent numbers 2 and 1+ 0, and 
consequently 
a = (2,1+ 6]. 

Similarly letting b,, be, c1, cg denote the systems of all numbers w divis- 
ible by (1, Be, 71, Ye respectively, we conclude from the corresponding 
congruences in §9 that 


b; = [3,1 + 4], bz = [3,1 — 4], 
4 =(7,34+6], c2=[7,3- 9]. 


If we now let m denote any one of these five systems, then m enjoys 
the following properties. 
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I. The sum and difference of any two numbers in m are also in m. 

II. Each product of a number in the system m by a number in the 
system o is a number in the system m. 

The first property, characteristic of each module, is evident. To es- 
tablish the second property for a system m whose basis consists of the 
numbers p, p’ it evidently suffices to show that the two products Oy, Oy! 
belong to the same system. For the system a this follows from the two 
equations 


20 = -1-242(1+6), (1+6)0=—-3-24+(1+6), 


and it is just the same for the other systems. But these two properties 
can also be established without these verifications, by appealing to the 
fact that each of the five systems m is the set of all numbers w in the 
domain o satisfying a congruence of the form 


vw = 0 (mod p), 


where jt, v are two given numbers in the domain o. 

We now call any system m of numbers in domain o enjoying properties 
I and II an ideal, and we begin by posing the problem of finding the 
general form of all ideals. Excluding the singular case where m consists 
of the single number zero, we choose an arbitrary nonzero number y in 
m. Then if y’ denotes the conjugate number, the norm N(p) = py’, 
and hence the product 9N(y) also belongs to the ideal m by virtue of II. 
Thus all the numbers in the module o = [1,6], when multiplied by the 
nonzero rational number N (jz), become numbers in the module m, which 
is at the same time a multiple of 0. But it is a consequence of (§3,2) that 
m is a finitely generated module, of the form [k,1+m6] where k, |, m are 
rational integers, among which k and m can be chosen positive. Since m 
already has property I, as a module, the question is what follows from 
property II, which says that the two products k@ and (J + m6)@ belong 
to the system m. The necessary and sufficient conditions for this, as one 
sees without difficulty, are that m divide k and | and that the rational 
integers a, b appearing in the expression 


m = [ma,m(b+ 4)] 
also satisfy the congruence 
b? = —5 (mod a). 


If we replace b by any number = b (mod a) then the ideal m is unchanged. 
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The five ideals above, a, 63, be, ¢1, ¢2 are evidently of this form, since 
(b+) can also be replaced by —(b + 8). 

The set of all numbers conjugate to the numbers in an ideal m is 
evidently also an ideal 


m, = [ma, m(—b + 6)]. 


Two such ideals m, m, may be called conjugate ideals. 

Let y be any number in the domain o. The system [p, 16] of all 
numbers divisible by 4 forms an ideal which we call a principal ideal,t 
and which we denote by o(j) or ou. It is easy to give it the above 
form [ma,m(b+ 6)]; m is the greatest rational integer that divides p = 
m(u + v@) and we have, moreover, 


aa Nw) 


pr? 


vb = u (mod a). 


Thus we find, for example, 
o(+1) = o = {1, 6], 
and 
0(2) = [2,26], 0(3) = [3,36], 0(7) = [7, 76], 
o(1+6) = [6,+1+6], o(3+6] = [14,+3 + 6], 
o(-2+6) =[9,F2+6], 0(2+30) = [49,417 + 4}, 
o(—1 +20) = [21,4104 6], 0(440) = (21,44+ 9]. 
Since all ideals are also modules, we say (following §2,1) that two num- 
bers w, w’ are congruent modulo the ideal m, and put w = w’ (mod m) 
when w — w’ is a number in m. The norm N(m) of the ideal m = 
[ma, m(b + 6)] is the number 
(o,m) = m?a 


of classes into which the domain o is partitioned modulo the module m 
(§4,4). If m is a principal ideal oy then the preceding congruence will 
be equivalent to w = w’ (mod yz) and we shall have 


N(m) = N(u). 
The norm of any number m(az+(b+6)y) in the ideal m = [ma, m(b+4)] 
{ If we extend the definition of ideal to the domain o of rational integers, or to the 
complex integers of Gauss, or to any of the five domains o considered in §7, then 


one easily sees that every ideal is a principal ideal. It is also evident that, in the 
domain of rational integers, property II is already contained in property I. 
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is equal to the product of N(m) = ma with the binary quadratic form 
axz?+2bry+cy? whose determinant, according to the definition of Gauss, 
is b? —~ac = —5.t 


§12. Divisibility and multiplication of ideals in o 
T shall now show how the theory of numbers w = x + y@ in the domain 
m can be based on the notion of ideal. For the sake of brevity, however, 
I shall be obliged to leave certain easy calculations to the reader. 

Just as in the theory of modules (§1,2), we say that an ideal m” is 
divisible by an ideal m when all numbers in m” belong to m. It follows 
that a principal ideal oy” is divisible by a principal ideal op if and only 
if the number yu” is divisible by the number y. Thus the theory of 
divisibility of numbers is contained in the theory of divisibility of ideals. 
One sees immediately that the necessary and sufficient conditions for 
the ideal m’ = [m’a”,m’(b” + 0)] to be divisible by the ideal m = 
[ma, m(b + 6)] are the three congruences 


m’a=m"a" = m"(b" —b) =0 (mod ma). 


The definition of multiplication of ideals is the following: if » runs 
through the numbers in the ideal m, and yp’ through the numbers in 
the ideal m’, then all the products uy’ and their sums form an ideal 
m” called the product} of the factors m, m’ and denoted by mm’. We 
evidently have 


om=m, mm =m'm, (mm’)n = m(m’n), 


whence it follows that products of any number of ideals satisfy the same 
theorems as products of numbers.{ Moreover, it is clear that the product 
of two principal ideals ou and op’ is the principal ideal o(yp’). 

Now given two ideals 

m = [ma,m(b+9)],  m’ = [m’a’,m'(b’ + 8)], 
we derive their product 
m” = mn’ = [m’a", m’(b" + 6)] 

¢ The general theory of forms is nevertheless simplified a little when we also admit 

the forms Az? + Bry + Cy?, where B is odd, and if we always understand the 

determinant of the form to be the number B? — 4AC. 


t The same definition also applies for multiplication of two modules. 
t See Dirichlet’s Vorlesungen tiber Zahlentheorie, §2. 
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with the help of the methods indicated in the first chapter (§4, 5 and 6). 
It is clear from the definition that the product mtv’ is a finitely generated 
module with basis consisting of the four products 
mm’'aa’, mm’'a(b' +6), mm’a’(b+ 9), 

mm’ (b + 6)(b! + 6) = mm’ [bb' — 5 + (b+ 0’) 4], 
of which only two are independent. Thus for the ideals considered above, 
for example 

bj = [3,146], co =(7,3—- 6], 


we find the product 
bite = [21,9 — 30,7 + 76,8 + 26]. 


This module is derived from the one considered at the end of the first 
chapter (§4,6), and by setting w, = 1, w2 = 8 we conclude 


bycq = (21, -17 + 6] = [21,4 + 6] = 0(4+ 8). 


In the same way we obtain all the following results, completely analogous 
to the hypothetical equations (4) of §7: 
0(2)=a, 0(3)=6,b2, 0(7) = e109; 
o(—2+6)=67, o(-2-—6) = 63; 
0(2+30)=c?, 0(2—30)= 2; 
o(1+6)=ab,, o0(1—0) = abo; 
0(34+6)=acy, 0(8— 4) = acg; 
o(-—14+ 26)=6ie,, o0(—1 — 26) = boe9; 
0(4+6)=bicg, 0(4— 6) = bey. 
To effect the multiplication of two ideals; m, m’ in general it is necessary 
to transform the basis of the four numbers above into one consisting of 
only two numbers ma", m’"(b"” + @). One arrives at this (by virtue of 
§4) via four equations of the form 
mm'aa’ = pm'a"” + qm" (b" + 0), 
mm’a(b! + 6) = p'm"a" + q’m"(b" + 6), 
mm’a'(b + 0) = p"m"a" + qm" (b” + 8), 
mm’ (bb! — 5 + (b+ b')6] = pm" a" + q!’m'"(b" + 8), 


where p,p’,...,q’” denote eight rational integers chosen so that the six 
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determinants formed from them, 


P= pq’ —qp’, Q=pq" — ap", R=pq” — ap", 
Jif 


U = pq” _ q'p", Ta=pq"-dp", S= pq" - gp” 


have no common divisor. From the four preceding equations, each of 
which decomposes into two, we now conclude without difficulty that 
these six determinants are respectively proportional to the six numbers 
a, a, +5, 
ce, c¢, vb —6b, 


where c and c’ are determined by the equations 
bb — ac = b'b! — a’! = -5. 


But, since these six numbers admit no common divisor,{ they must 
precisely coincide with the six determinants. It follows, since gq = 0 
and q’, g”, q’” can have no common divisor, that we can determine the 
product m” = mm’ of two given factors m, m’ as follows. Let p be the 
greatest common (positive) divisor of the three given numbers 


a=pq’, a'=pq’, b+b'=pq"". 
We have 
m” = pmm’, a= y7=49¢, 


and b” is determined by the congruences 


5 


7b” = qb’, qb” = qb, g”b" = (mod a’) ‘ 


At the same time we have 6b" = —5 (mod a”), that is 

b’b” —a"c’ = —5 
where c” is a rational integer and, to use a terminology employed by 
Gauss,} the binary quadratic form (a”,b”,c”’) is composed from the two 


forms (a, b,c) and (a’,b’,c’). 
The values of m”, a” yield m 


N(mm’) = N(m)N(m’). 


2 2 
"“q"' = m?a-m'“a’, and hence the theorem 


{ This will not always be so in the domain of numbers x + y/—3. 
t Disquisitiones Arithmeticae, art. 235, 242. 
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It is also necessary to note the special case where m’ is the ideal m; 
conjugate to m. The preceding formulas then yield the immediate result 


mm, = oN(m). 


The two notions of divisibility and multiplication of ideals are now con- 
nected in the following manner. The product mm’ is divisible by both 
m and m/’ since, by property II of ideals, all the products yy’ whose 
factors belong respectively to m, m’ are in both these ideals; the same 
then holds for the product ideal itself. Conversely, if the ideal m” = 
[m"a", m'(b" + 0)] is divisible by the ideal m = [m, m(b+ 6)] then there 
is exactly one ideal m’ such that mm’ = m”. In fact, if we let m, denote 
the ideal conjugate to m and form the product 


mym” — [m!"a’, m'"(b' + 6)] 


by the preceding rules, then it follows from the three congruences estab- 
lished at the beginning of this § that m’” is divisible by N(m) = m?a, 
and hence that m’” = m?am/’, where m’ is an integer. Combining this 
with the preceding theorem that mm; = 0(m?a), we easily conclude 
that the ideal m’ = [m’a’, m’(b’ + 6)], and it alone, satisfies the condition 
mm’ = m”. At the same time it follows that the equation mn’ = mm” 
always implies m’ = m’”. 

Now to arrive at the conclusion of this theory, it only remains to 
introduce the following notion. An ideal p, different from o and divisible 
by no ideals other than o and p, will be called a prime ideal. If 7 is a 
particular number, then the system t of all roots p of the congruence 
np = 0 (mod p) forms an ideal, because it has properties I and II. This 
ideal t is a divisor of p, because all numbers in p are also roots of this 
congruence. Thus, if p is a prime ideal, then t must be o or p. If the 
given number 7 is not in p then the number 1, in o, will not be a root 
of the congruence, and hence in this case t will be neither o nor p. That 
is, all the roots p must be in p. Thus we have established the following 


theorem:t 


“A product np of two numbers 7, p is not in a prime ideal p unless at 
least one of the factors is in p.” 


And this immediately yields the theorem: 


{ This theorem leads easily to the determination of all prime ideals contained in 0, 
and they correspond exactly to the prime numbers, actual and ideal, enumerated 
in §10. 
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“If neither of the ideals m, m’ is divisible by the prime ideal p then 
their product will also not be divisible by p.” 


Because if m, m’ respectively include numbers yp, p’ not in p, then mm’ 
includes the number py’ not in p. 

Combining the theorem just proved with the preceding theorems on 
the connection between divisibility and multiplication of ideals, and 
bearing in mind that o is the only ideal with norm 1, we arrive, by 
exactly the same reasoningf as in the theory of rational numbers, at the 
following theorem: “Each ideal different from o is either a prime ideal 
or else uniquely expressible as a product of finitely many prime ideals.” 
It follows immediately from this theorem that an ideal m” is divisible by 
an ideal m if and only if all the powers of prime ideals that divide m also 
divide m”. If m = op and m” = op” are principal ideals then the same 
criterion also decides the divisibility of the number py” by the number p. 
Thus the theory of divisibility of numbers in the domain o is restored to 
firm and simple laws. 

All this theory can be applied almost word for word to any domain 
o consisting of all the integers of a quadratic field 2, when the notion 
of integer is defined as in the Introduction.{ However, even though 
this approach to the theory leaves nothing to be desired in the way of 
rigour, it is not at all what I propose to carry out. One notices, in 
fact, that the proofs of the most important propositions depend upon 
the representation of an ideal by the expression [ma,m(b + @)] and on 
the effective realisation of multiplication, that is, on a calculus which 
coincides with the composition of binary quadratic forms given by Gauss. 
If we want to treat fields 2 of arbitrary degree in the same way, then 
we shall run into great difficulties, perhaps insurmountable ones. Even 
if there were such a theory, based on calculation, it still would not be 
of the highest degree of perfection, in my opinion. It is preferable, as 
in the modern theory of functions, to seek proofs based immediately on 
fundamental characteristics, rather than on calculation, and indeed to 
construct the theory in such a way that it is able to predict the results 
of calculation (for example the composition of decomposable forms of all 
degrees). Such is the goal I shall pursue in the chapters of this memoir 
that follow. 


t See Dirichlet’s Vorlesungen tiber Zahlentheorie, §8. 

{ The domain, mentioned above, of numbers z + y/—3 where z, y are rational 
integers is not a domain of this nature. However, it forms only a part of the domain 
o of all the numbers z + yp, where p is a root of the equation p* + p+1=0. 
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General properties of algebraic integers 


In this chapter we first consider the domain of all algebraic integers. 
Then we introduce the notion of a field 2 of finite degree, and determine 
the constitution of the domain o of integers of the field 2. 


§13. The domain of all algebraic integers 
A real or complex number @ is called algebraic when it satisfies an equa- 
tion 
6” + 0,0"! +420"? + --- +. Gn_10 + an = 0 

of finite degree n with rational coefficients a1, a2,...,@n-1,@n. If the 
coefficients of this equation are rational integers, that is, numbers from 
the sequence 0, +1, +2,..., then @ is called an algebraic integer, or simply 
an integer. It is clear that the rational integers are also algebraic integers 
and, conversely, if a rational number @ is at the same time an algebraic 
integer then, by virtue of a known theorem, it will also be one of the 
rational integers 0,+1,+2,.... From the definition of integers we easily 
derive the following propositions: 

1. The integers are closed under addition, subtraction and multipli- 
cation, that is, the sum, difference and product of any two integers a, 3 
are also integers. 


Proof. By hypothesis, there are two equations of the form 
$(a) = a* + pia? +--+ +pa-10 + Pa = 0, 
$(8) = 6° +n 8°? +++» +H-18 +o = 0, 


in which all the coefficients p, g are rational integers. We now put ab = n 
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and let 

W1,W2,..-, Wn 
denote the n products at Be formed from one of the a numbers 
a1 


1,a,07,...,a 


and one of the b numbers 


1, B, 6’, eet wo. 
If w now represents one of the three numbers a + 3, a — G3, af then we 
easily see that each of the n products ww 1,ww,...,WwW,, can be reduced 


immediately, with the help of the equations ¢(a) = 0, #() = 0, to the 
form 
kw + kow2 a knwn, 
where kj, k2,...,kpy, are rational integers. We then have n equations of 
the form 
wu = kia + khwet-:-+kiwn, 
ww = kia + kgwe t+++ + kw, 


wy = ke wy + Ke we $2 $M Wy, 


all coefficients k of which are rational integers. Now by elimination of 


the n numbers w1,w2,...,W,, which include the nonzero number 1, we 
derive the equation 
ki-—w ke ki 
ee 
a 


which is evidently of the form 
w" + ew 14 ---+en_ iw +en =0, 


where the n coefficients e are formed from the numbers k by addition, 
subtraction and multiplication, and hence are rational integers. Thus 


w, and consequently each of the three numbers a + 3, a — 3, af, is an 
integer. Q.E.D. 


2. Each root w of an equation of the form 


F(w) =w™ +aw™ 14 Bw 2 4.---+€=0, 
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where the coefficient of the highest degree term is unity and the others 
a, 3,...,€ are integers, is likewise an integer. 


Proof. By hypothesis, the coefficients a, 3,...,€ are roots of equations 
¢(a) =a°+pia**+...+ pa =0, 
$(8) = BP +qB°1+...+4 =0, 


where all the coefficients p,g,...,s are rational integers. Now if we put 
n = mab---e and let w1,we,...,wW, denote all the n products of the 
form 


é 


w™ a2’ Be eee ; 
where the exponents are rational integers satisfying 
O<m'<m, O<a'<a, 0<U <b ..., O<e' <e, 


then it is easily seen that, with the help of the equations F(w) = 0, 
o(a) = 0, ¥(8) = 0, ..., x(e) = 0, all the products wu, wwe,...,WWn 
reduce immediately to the form 


kywy + kqwe +--+ knwn, 


where kj, ke,..., kn are rational integers. It then follows, as in the pre- 
vious proof, that w is an integer. Q.E.D. 


It follows from the latter theorem that, for example, if a is any integer 
and r, s are positive rational integers then ¥/a” is also an integer. 


§14. Divisibility of integers 
We say that an integer a is divisible by an integer G when a = By 
for some integer y. The same thing is expressed by saying that a is a 
multiple of 3, or that 6 divides a, or that 6 is a factor or divisor of a. 
It follows from this definition and Theorem 1 of §13 that we have the 
following two elementary propositions, mentioned in the Introduction. 


1. If a, a’ are divisible by 4 then a + a’, a — a’ are also divisible by 
B, 
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2. If a’ is divisible by a and a is divisible by : then a’ is also divisible 
by yp. 


However, it is necessary to pay particular attention to units, that is, 
to the integers that divide all integers. A unit « must therefore divide 
the number 1, and conversely it is evident that every divisor € of the 
number 1 is a unit, because every integer is divisible by 1, and hence 
(by virtue of proposition 2 above) also by «. At the same time we see 
that each product and quotient of two units is itself a unit. 

If each of two nonzero integers a, a’ is divisible by the other then 
we have a’ = ae, where € is a unit. Conversely, if « is a unit then 
each of the two integers a and a’ = ae is divisible by the other. We 
call two numbers a, a’ associates if they are related in this way, and 
it is clear that any associates of a third number are associates of each 
other. In all questions concerning just divisibility, associates behave as 
a single number. Indeed, if a is divisible by 8, then every associate of a 
is divisible by every associate of (. 

A deeper investigation will enable us to see that two nonzero integers 
a, 3 have a greatest common divisor, which can be put in the form 
aa’ + 88’ where a’ and # are integers. This important theorem is 
not at all easy to prove with the help of the principles developed thus 
far, but we shall later (§30) be able to derive it very simply from the 
theory of ideals. I shall therefore end these preliminary considerations of 
the domain of all integers with the remark that there are absolutely no 
numbers in this domain with the character of prime numbers. Because, 
if a is a nonzero integer, and not a unit, then we can decompose it in 
infinitely many ways into factors which are integers but not units. For 
example, we have a = ,/a- \/a and also a = (3182 where (3;, G2 are the 
two roots (@ of the equation 6? — 8+ a = 0. It follows from Theorem 2 
of §13 that ./a, 1, G2 are integers as well as a. 


§15. Fields of finite degree 


The property of being decomposable in infinitely many ways, whose pres- 
ence has just been pointed out in the domain of all integers, disappears 
again when the integers under consideration are confined to a field of 
finite degree. First we have to define the extent and nature of such a 


field. 
Each algebraic number 6, whether an integer or not, evidently satisfies 
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an infinity of different equations with rational coefficients, that is, there 
is an infinity of polynomial functions F(t) of one variable ¢ which vanish 
for £ = 9 and whose coefficients are rational. However, among all these 
functions F(t) there is necessarily one f(t) whose degree n is as small 
as possible, and it follows immediately from the well-known method of 
polynomial division that each of the functions F(t) will be algebraically 
divisible by this function f(t), and that f(t) will not be divisible by any 
polynomial of lower degree and rational coefficients. For this reason, the 
function f(t) and also the equation f(0) = 0 are called irreducible, and 
it is clear at the same time that the n numbers 1, 61,6?,...,0"—! form 
an irreducible system (§4,1). 
We now consider the set 2 of all numbers w of the form $(@), where 


$(t) = x0 + it + rot? +--+ + anit?) 


is any polynomial in ¢ with rational coefficients 19,21, £2,...,2n—1 and 
degree < n. We first remark that each such number w = (0) is uniquely 
expressible in this form, by virtue of the irreducibility of f(t). We then 
see easily that the numbers w are closed under rational operations, that 
is, addition, subtraction, multiplication and division. For the first two 
operations this evidently follows from the common form (6) of all the 
numbers w, and for multiplication it suffices to remark that each number 
of the form ~(@), where ~(t) is a polynomial of any degree with rational 
coefficients, is likewise a number w, because if we divide w(t) by f(t) 
the remainder will be a function ¢(¢) of the kind above, and at the same 
time we have ~(@) = $(@). Finally, to treat the case of division it is 
enough to show that if w = ¢(0) is nonzero then its reciprocal w—1 also 
belongs to the system 2. Well, since ¢(¢) has no common divisor with the 
irreducible function f(t), the method for finding the greatest common 
divisor of the polynomials f(t), o(t) gives, as we know, two polynomials 
fi(t), é1(t), with rational coefficients, satisfying the identity 


f(t) fit) + o(t)¢i(t) = 1. 


When t = @ this gives the result claimed. 

I call a system A of numbers a (not all zero) a field when the sum, 
difference, product and quotient of any two numbers in A also belongs 
to A. The simplest example of a field is the system of rational numbers. 
It is easy to see that this field is contained in any other field A since, if 
we choose any nonzero number a in A, it is necessary that the quotient 
1 of the numbers a and a belong to A, whence the claim follows, since 
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all the rational numbers are engendered from the number 1 by repeated 
additions, subtractions, multiplications and divisions. 

Our system 2 is a field, by the results just proved for the numbers 
w = ¢(@). The rational numbers come from ¢(@) by annulling all the 
coefficients x1, %2,..-,2%n—-1 which follow xp. A field Q obtained from 
an irreducible equation f(@) = 0 of degree n in the manner indicated is 
called a field of finite degree,t and the number n is called its degree. Such 
a field 2 includes n independent numbers, for example 1,0, 0?,...,0"7}, 
whereas any 2+ 1 numbers in the field evidently form a reducible system 
(84,1). The latter property, combined with the definition of field, can 
also serve as the definition of a field 2 of n“ degree. However, I shall 
not go into the proof of this assertion. 

Now if we arbitrarily choose n numbers 


wi = ¢1(6), we =¢2(8), .--, Wn =Gn(9) 


of the field 0, these numbers (by §4,2) form an irreducible system if 
and only if the determinant of their n? rational coefficients z is nonzero. 
In this case we call the system of n numbers w1,w,...,Wn a basis of 
the field Q. Then it is evident that each number w = ¢(6) is always 
expressible, uniquely, in the form 


w= hyo + howe +---+hyawn, 


with rational coefficients hi, he,...,hn. And conversely, all numbers w 
of this form are in 2. The rational coefficients h1,h2,...,hn will be 
called the coordinates of the number w with respect to this basis. 


§16. Conjugate fields 


We ordinarily understand substitution to be an act by which objects or el- 
ements being studied are replaced by corresponding objects or elements, 
and we say that the old elements are changed, by the substitution, into 
the new. Now let 2 be any field. By an isomorphismt of 22 we mean a 
substitution which changes each number 


Q, B, at p, a— p, af, a/ 


} If we understand a divisor of a field A to be any field B whose members all belong 
to A, then a field of finite degree can also be defined as a field with only a finite 
number of divisors. The word divisor (and the word multiple) is used here in a 
sense directly opposite to that used in speaking of modules and ideals, but this 
should not result in any confusion. 

} Dedekind calls it a permutation. (Translator’s note.) 
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of 2 into a corresponding number 
a’, B, (a+ By, (a-£)', (a), (a/B)' 
in such a way that the conditions 


(1) (a+ B) =a' + B, 

(2) (a8)! = a’ 8", 

are satisfied and the substitute numbers a’, @’,... are not all zero. We 
shall see that the set 2’ of the latter numbers forms a new field, and 
that the isomorphism also satisfies the following conditions: 


(3) (a— By =a’ - ft, 
(4) (a/B)! = o'/f'. 


Indeed, if we let a’, 6’ be any two numbers in the system 10’, then there 
will be two numbers a, (3 in the field Q changed respectively into a’, (’. 
But since the numbers a + (, af are likewise in 2, it follows from (1) 
and (2) that the numbers a’ + (’, a’@’ are in 2’. Thus the numbers of 
the system 1’ are closed under addition and multiplication. Moreover, 
since the numbers a = (a— 8)+/ and a—f are likewise in Q, it follows 
from (1) that 
a’ = (a — B)' +f", 

which is condition (3). Thus the numbers of the system 1’ are also 
closed under subtraction. Finally, if @’ is nonzero then, by virtue of 
(1), @ will also be nonzero and hence a/f is a number in the field 2. 
Since we now have a = (a/f)@, it follows from (2) that we also have 
a! = (a/B)'B', which is condition (4). Thus the numbers of the system 
are also closed under division, and consequently 12’ is a field. Q.E.D. 


We also note that if 6’ = 0 then so too is 8 = 0; otherwise every 
number a in the field 9 could be put in the form (a/()(, with the result 
that a’ = (a/)'B’ = 0, whereas we have agreed that the numbers a’ 
in the system 91’ are not all zero. It evidently follows from this, in view 
of (3), that an isomorphism changes two different numbers a, 7 in the 
field 2 into two different numbers a’, (’ in the field 2’, so that each 
number a’ in the field ’ corresponds to a single number a in the field 
Q. The correspondence can therefore be reversed in a unique manner, 
and the substitution that changes each number a’ in the field 2’ into 
the corresponding number a in the field 2 will be an isomorphism of 
the field Q’, because it satisfies the characteristic conditions (1) and (2). 
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Each of the two isomorphisms will be called the inverse of the other. We 
also call Q and 2’ conjugate fields, and any two corresponding numbers 
a, a! will be called conjugate numbers. For each field 2 there is evidently 
an identity isomorphism of 2, which replaces each number in 2 by itself. 
Thus every field is conjugate to itself. It is also easy to see that two 
fields conjugate to a third are conjugate to each other. Because if each 
number a in a field 2 is changed by an isomorphism P into a number 
a’ in a field 9’, and a’ is likewise changed by an isomorphism P’ into a 
number a” in a field 2”, then it is clear that the substitution changing 
each a in 2 into the corresponding a” in 1” is likewise an isomorphism 
of the field 2, and we denote it by PP’. If we let P-1 denote the inverse 
isomorphism of P, then PP-! will be the identity isomorphism of Q, 
and 2” will be changed into Q by the isomorphism 


(PP’)7! = Pp, 


We have previously remarked that each field includes all the rational 
numbers, and it is easy to show that each of these is changed into itself 
by an isomorphism of the field. Because, if we take a = @ in (4) we get 
1’ = 1, and then, since each rational number can be engendered from 1 
by a series of rational operations, our proposition follows immediately 
from properties (1), (2), (3), (4). 

Now let 6 be any number in the field Q and let R(t) be any rational 
function of t with rational coefficients. The number w = R(6), in the 
case where the denominator of R(t) does not vanish for t = 6, will also 
be in 2, and if @ is changed into the number 6’ by an isomorphism of the 
field, then the number w will be changed into the number w’ = R(6’), 
since it is formed from the number @, and the rational coefficients of R(t), 
by rational operations. It follows immediately that if 0 is an algebraic 
number, and hence satisfies an equation of the form 0 = F(6) with 
rational coefficients, we must also have 0 = F'(6’). Thus each number 
6’ conjugate to an algebraic number @ is likewise an algebraic number, 
and if @ is an algebraic integer, 6’ will also be an algebraic integer. 

After these general considerations, which apply to all fields, we return 
to our example of a field 2 of finite degree n, and pose the problem of 
finding all the isomorphisms of 9. Since all numbers w in such a field 
are, by §15, of the form ¢(6) where @ is a root of an irreducible n** 
degree equation 0 = f(6), it follows from the preceding results that an 
isomorphism of 2 will be completely determined by the choice of a root 
6’ of the equation 0 = f(6’), because when 6 is changed into 0’, w = (0) 
is changed at the same time into w’ = ¢(@’). Conversely, if we choose 
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0’ to be any root of the equation 0 = f(6’), and replace each number 
w = (0) in the field 2 by the corresponding number uw’ = ¢(6’), then 
this substitution will really be an isomorphism of 2, that is, it will satisfy 
conditions (1) and (2). To show this, we let $1(£), ¢2(t),... denote any 
particular functions of the form ¢(t). Now if we have 


a= 1(9), B= 2(9), a+ B as $3(9), af = pa(8), 

and consequently 
a! =9$1(0'), 8’ =$2(8), (a+ By =93(6'), (a8)' = $4(0'), 
it follows from the equations 
$3(6) = b1(9) + $2(8), (8) = $1(9)b2(4) 

and the irreducibility of the function f(t) that we have identically 

s(t) = di(t) + got), a(t) = dr(t)ba(t) + b5(2) F(@). 
Taking t = 6’, this gives the equations (1) and (2) we have to show. If 
we now put 

f(t) = (t- A)(E—8")--- (t- 9) 


then the n roots 6’,6”,...,0°™) will be different, since the irreducible 
polynomial f(t) cannot have a common divisor with its derivative f’(t), 
and each of them will correspond to an isomorphism P’, P”,..., P™ of 
the field 9. The isomorphism P‘") changes each number w = ¢(0) of 
the field © into the conjugate number w‘") = (0‘")) of the conjugate 
field 2). To avoid misunderstanding, we point out that the n conju- 
gate fields 2), although derived from Q by n different isomorphisms, 
nevertheless may not all be different. If they are all the same, 2 will 
be called a Galois or normal field.{ The algebraic principles of Galois 
amount to reducing the study of arbitrary fields of finite degree to the 
study of normal fields; however, lack of space prevents me from going 
further into this subject. 


§17. Norms and discriminants 
The norm N(w) of a number w in a field 2 of degree n is the product 
(1) N(w) = ww" ---w™ 


{+ Nowadays called a Galois or normal extension (of the rationals); however, it would 
distort Dedekind’s meaning to use the term “extension” here. (Translator’s note.) 
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of the n conjugate numbers w’, w”,...,w°") of w under the isomorphisms 
P',P”,...,P™. It does not vanish unless w = 0. If w is a rational 
number, then all the n numbers w") equal w, and hence the norm of a 
rational number is its n** power. If a, @ are any two numbers in the 
field 2 we have (a3) = a‘) 8) and consequently 


(2) N(aB) = N(a)N(6). 

The discriminant A(a1, Q2,...,Qn) of any n numbers a1, Q2,...,@n in 
the field Q is the square 

(3) A(a1, 02,.-.,4n) = (Sotalol! --- af”)? 


of the determinant formed by the n? numbers al”, By virtue of a 
well-known proposition in the theory of determinants we then have the 
relation 


(4) A(1,0,67,...,0°71) = (-1)" VPN(F'()), 


and since f’(@) cannot be zero, by the irreducibility of the polynomial 
f(t), it follows that the discriminant (4) has a nonzero value. 

Now if the n numbers w1,w2,...,wp form a basis of the field Q (§15), 
and if 


wW=hyw + howe +--+ hnwn 


is any number in the field then, since the coordinates hj, he,..., hy are 
rational numbers, the isomorphism P‘") will change w into the number 


w) = byw”) + hows? +--+ haw, 
from which we conclude that 
(5) A(ay, Q2,...,Qn) = a?7A(w1,we,...,Wn), 


where a is the determinant of the n? coordinates of the n numbers 
Q1,Q2,--.,Q,. This implies, first, that the discriminant of the basis 
W1,W9,...,Wpy, cannot vanish, otherwise every discriminant would vanish, 
contrary to the fact from above that A(1,0,07,...,8"—1) is nonzero. At 
the same time it follows that A(a,a2,...,Q,) will vanish if and only 
if the numbers a1, @2,...,Q@n, are dependent on each other (§4,2), and 
hence do not form a basis of 2. 

Since the numbers in a field are closed under multiplication, for any 
number p in 2 we can put 
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fy = M1,1W1 + Me,1W2 +--+ + Mn Wn, 
pw, = M1,2W1 + M2,2W2 +--+ Mn eWn, 


(6) 


Lin = My nW1 + MenW2 +++- + MnnWn, 
where the n? coordinates m;,;; are rational numbers. Applying the n 
isomorphisms P‘) yields n? new numbers of the form 


yOu” = maw” + mows” + ile + Mn, gw”? 


and, since their determinant is 
N(p) S twill. wi = > $m1,1M2,2°°*Mn,n S- tui wll wl), 
we conclude that 
(7) Nw = +71,1M2,2°+*Mn,n, 
because the determinant 


So tw wg wl = /A(w1,we,.--,Wn) 


is nonzero. 

It follows that every norm is a rational number and, by virtue of (4) 
and (5), so is every discriminant. These two propositions could also have 
been deduced from the theory of transformations of symmetric functions, 
but I wish to avoid relying on this. 

If we replace the jz in the equations (6) by 4—z, where z is any rational 
number, then the coordinates m;,, are unchanged except for the m;,; on 
the diagonal, each of which is replaced by m,,; — z. Theorem 7 is then 
changed into the equation 


m11-—2 ™2,1 cee Mn 
™1,2 12,2 -—Z% «. Mn,2 
‘ = (pu! - 2)(u" — z)---(u — 2), 
Min Man ve Mann — 2 
which, being valid for every rational value of z, is necessarily an identity 
in z. At the same time we see that the n numbers p’, p”,..., u(™ con- 


jugate to the number yp are the set of roots of an n“ degree equation 
whose coefficients are rational numbers. 


§18. The integers in a field Q of finite degree 


After these preliminaries, we now pass on to our main objective, the 
study of the integers in the field 2 of degree n, the set of which we 
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denote by o. Since the sum, difference and product of any two integers 
are also integers (by §13,1), and in 2 (by §15), the domain 0, includ- 
ing all the rational integers, is closed under addition, subtraction and 
multiplication. However, the first thing is to put all these numbers in a 
common, simple form. The following considerations lead us to it. 

Since each algebraic number w is a root of an equation of the form 


cw” + cw) 4 +++ + Cm—1W + Cm = 0, 


whose coefficients c,¢1,...,€m—1,Cm are rational integers, it follows, 
multiplying by c”—!, that each such number yields an integer cw when 
multiplied by the nonzero rational integer c. Now if the m numbers 


W1,W2,...,Wy, form a basis of the field 2, we can take nonzero rational 
numbers a1, 4@2,...,@n, so that the n numbers 
1 = 41W1, A2=A2W2, ..., On =OAnWn 


become integers, evidently forming a basis of 2, since they are indepen- 
dent (by §4,2). It follows that their discriminant A(a1,a2,...,Q,) will 
be a rational number (by §17), and in fact a nonzero integer since, by 
definition, it is formed from the integers a) by addition, subtraction 
and multiplication. Moreover, we obtain all numbers w in the field Q by 
letting the coefficients 11, 22,...,£%n in the expression 


W= 2710, + 22Q2+-++ + Fn 


run through all rational values. If we restrict their values to be rational 
integers then we certainly obtain only integers w (§13,1). However, it is 
likely that not all integers in the field 2 will be represented in this way. 
The situation relates to the following very important theorem: 


If there is an integer 8 of the form 
kyay + kgag +... +kpan 
p= en 
where k, ky, ko,...,kn are rational integers without common divisor, then 
there is a basis of the field consisting of n integers G1, B2,..-,; Bn which 
satisfy the condition 


A(a1,02,...,0n) = k°A(B1, B2,---,Bn)- 


Proof. Since 2,01, @2,..., Qn, are integers, they form a basis of a module 
b = [G, a1, @2,...,Qn] which includes only integers from the field 9. But 
since only n of these n + 1 numbers are independent there are (§4,5) 
n independent numbers (1, G2,..., 8, which form a basis of the same 
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module b = [@1, G2,..., 8p] and which are therefore integers in the field. 
We then have n + 1 equations of the form 


B=c1B1 + C282 +---+enBn, 
O11 = €1,181 + €2,182 +°*+ +n Bn; 
a2 = 1,281 + C2282 +++++en28n, 


whose n(n + 1) coefficients are rational integers and whose n+ 1 partial 
n x n determinants, obtained by suppressing one horizontal line, have 
no common divisor (§4,6). If we put 


S £01,1€2,2°' Cnn = ¢, 
then we have (§17,(5)) 
A(ay, (60 eran) Qn) = 2 A(B1, Be, seis Bn). 


Substituting the preceding expressions for G,a1,Q2,..., Qn, in the equa- 
tion kG = kya, + kpag +...+kpa, and observing that (;, f2,..-, Bn 
are independent, we find that 


key = kyea + keeig + +++ + kneijn, 
keg = kyc2,1 + kec2,2 + +++ + knean, 


Ken = kien + kecn2 + +++ + knenn- 
If we now replace the elements 
Cir, Car, +--+) Cnr 
in the r*" row of the determinant c by the respective elements 
C1, €2, eee) Cry 


we conclude from the preceding equations, using a well-known theorem, 
that the partial determinant obtained has value che Thus then +1 
quantities 


ck ck, _~—cka ckn 
are rational integers with no common divisor, and since the same is 
true of the n+ 1 numbers k, ki, ko,..., km, we necessarily have c = +k. 


QED. 


If k > 1, so that the integer ( is not in the module a = [a1, a2,..., Qn}, 
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then there is a basis of the field consisting of n integers (1, G2,..., Bp with 
discriminant A((@1, G2,...,Bn) of absolute value < A(a1,Q2,...,Qn). 
Now since the discriminant of every integer basis of the field 2 is a ratio- 
nal nonzero integer, as shown above, there must be a basis w, we,...,Wn 
whose discriminant 


A(wi, we, Her , Wn) =D 
has minimum absolute value. And it follows immediately from the pre- 
ceding results that, relative to such a basis, each integer 


w= hywy + howe +--+ + hnwn 


in the field 2 must have integer coordinates hy, he,..., hn. Moreover, an 
integer w is not divisible by a rational integer k unless all its coordinates 
are divisible by k. Conversely, since each system of integer coordinates 


hy, ha,...,hn produces an integer w, the set o of all integers in the field 
Q is identical with the finitely generated module [w1,w2,...,Wn] whose 
basis consists of the n independent integers w1,we,...,Wn.- 


The discriminant D of such a basis is a fundamentally important in- 
variant of the field 2. For this reason we call it the fundamental number 
or the discriminant of the field 2, and represent it by A(Q). In the 
singular case n = 1, 2 is the field of rational numbers, and we take its 
discriminant to be the number +1. To clarify the general case we again 
consider n = 2, that is, the case of a quadratic field. 

Each root @ of an irreducible quadratic equation is of the form 


6=a+bvd, 


where d is a nonsquare rational integer which is also not divisible by 
a square (except 1). The numbers a, b are rational, and 6 is nonzero. 
The set of all the numbers $(0) in the corresponding quadratic field is 
evidently the set of all numbers of the form 


w=t+uvd, 


where t, u run through all rational values. Under the nonidentity iso- 
morphism of the field, Vd changes into —/d, and hence w changes into 
the conjugate number 


w' =t—uvd, 


which is also in 2. Thus 2 is a normal field (§16). To investigate the 
integers w we put 
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where 2, y, 2 are rational integers with no common divisor and z can be 
assumed positive. Now if w is an integer, so is w’ (§16) and consequently 

w+! = a ww" = ia ay 

z Zz 

must also be integers. Conversely, if the latter are integers, then so is w 
(§13). Suppose that e is the greatest common divisor of z and z. It is 
necessary that e? divide x? — dy”, hence also dy” and finally y?, since d 
is not divisible by any square other than 1. Then e must also divide y, 
and hence be equal to 1, since z, y, z have no common divisor. Thus z 
is prime to x while z divides 2x, which means that z = 1 or z = 2. In 
the first case, w = « + yVd is certainly an integer. In the second case x 
is odd, so x? = 1 (mod 4), and since we must have xz? = dy* (mod 4) it 
is necessary that y also be odd, and therefore we have d = 1 (mod 4). If 
the latter condition is not satisfied, that is, if d = 2 or d = 3 (mod 4), 
then z must equal 1 and so we have o = [1, Vd] and 
2 


1 vad 
D=|i Vy = Ad. 
But if we have d= 1 (mod 4), z can also equal 2, and we have 
2 
1+vd 1 Lid 
o= [i 5 | and D= i =v =d. 


These two cases can be combined into one by noting that o = [1, D+vD) 
in each. It is also clear that a quadratic field is completely determined 
by its discriminant D. This is not so for the next case, namely n = 3, 
where invariants additional to the discriminant are necessary for the 
complete determination of a cubic field. However, we shall be able to 
give a complete explanation of this fact only with the help of the theory 
of ideals. 

We now return to considering an arbitrary field 2 of degree n, and 
we add the following remarks on divisibility and congruence of numbers 
in the domain 6. Let A, u be two such numbers, and suppose that A 
is divisible by 4. By the general definition of divisibility (§14) we then 
have \ = pw, where w is an integer, and since the quotient w of the two 
numbers , p belongs to 2, by definition of a field, w will likewise be a 
number in the domain o. The system m of all numbers in the field 2 


f It follows, for example, in the case d = —3 that the integers of the field are not all 
of the form x + y/—3 where x, y are rational integers. 
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divisible by yz therefore consists of all numbers of the form pw, as w runs 
through all numbers of the domain 0 = |w1,w2,..., Wn], that is, through 
all numbers of the form 


Ww = hyw + howe +--- +hnawn, 


whose coordinates hi, h2,..., hy are rational integers. Consequently we 
have m = [w1, [we2,..., Wn]. We now say that two integers a, @ in the 
domain o are congruent modulo jz, and write 


a= (mod p), 


when the difference a — £ is divisible by y and hence in m. Thus this 
congruence is completely equivalent to the following: 


o: = B (mod m), 


the meaning of which was explained in §2. In the contrary case, a, 3 
are called incongruent modulo y. If we understand a class modulo pz to 
be the set of all numbers in o congruent to a particular number, and 
hence congruent to each other, then, in the notation of §2, the number 
of these classes will be (0,m). And since the integers p11, pwe,..., Wn, 
forming the basis of m are connected to the numbers w,w2,...,wW, by 
n equations of the form (6), (§17), in which the coefficients mj, are 
necessarily rational integers, it follows from the equation following (7), 
together with Theorem 4 of §4, that the number of these classes is 


(o,m) = £N(u). 


The system m is identical with o if and only if yz is a unit, in which case 
+N(p) = (0,0) = 1. 

In this conception of congruence, where an actual number yu appears 
as divisor or modulus, there is a complete analogy with the theory of 
rational numbers. However, it is plain, as we have already indicated 
in detail in the Introduction and in Chapter 2, that completely new 
phenomena concerned with the decomposition of numbers into factors 
arise in this same domain o. These phenomena are brought back under 
the rule of simple laws by the theory of ideals, the elements of which will 
be covered in the next chapter. 


4 
Elements of the theory of ideals 


In this chapter we shall develop the theory of ideals to the point indicated 
in the Introduction, that is, we shall prove the fundamental laws which 
apply to all fields of finite degree, and which regulate and explain the 
phenomena of divisibility in the domain o of all integers in such a field 
Q. The latter are what we refer to when we speak of “numbers” in 
what follows, unless the contrary is expressly indicated. The theory is 
founded on the notion of ideal, whose origin has been mentioned in the 
Introduction, and whose importance has been sufficiently illuminated by 
the example in Chapter 2 (§§11 and 12). The exposition of the theory 
that follows coincides with the one I have given in the second edition of 
Dirichlet’s Vorlesungen tiber Zahlentheorie (§163). It differs mainly in 
external form; however, if the theory has not been shortened it has at 
least been simplified a little. In particular, the principal difficulty to be 
surmounted is now thrown more clearly into relief. 


§19. Ideals and their divisibility 


As in the previous chapter, let 9 be a field of finite degree n, and let o 
be the domain of integers w in 2. An ideal of this domain o is a system 
a of numbers a@ in o with the following two properties: 

I. The sum and difference of any two numbers in a also belong to a, 
that is, a is a module. 

II. The product aw of any number a in a with a number w in o isa 
number in a. 

We begin by mentioning an important special case of the concept of 
ideal. Let yu be a particular number; then the system a of all numbers 
@ = pw divisible by yz: forms an ideal. We call such an ideal a principal 
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ideal and denote it by o(u), or more simply by op or jo. It is evident 
that this ideal will be unchanged when y is replaced by an associate, 
that is, a number of the form ey, where € is a unit. If y is itself a unit we 
have oy = 0, since all numbers in o are divisible by py. It is easy to see 
that no other ideal can contain a unit. Because if the unit ¢ is in the ideal 
a then (by ITI) all products ew, and hence all numbers w in the principal 
ideal 0, are in a. But since, by definition, all numbers in the ideal a are 
likewise in 0, we have a = o. The ideal o plays the same role among the 
ideals as the number 1 plays among the rational integers. The notion 
of principal ideal o also includes the singular case where p = 0, where 
the resulting ideal consists of the single number zero. However, we shal] 
exclude this case from now on. 

In the case n = 1, where our theory becomes the old theory of num- 
bers, every ideal is evidently a principal ideal, that is, a module of the 
form [m] where m is a rational integer (§§1 and 5). The same is true 
for the special quadratic fields considered in Chapter 2 (§6 and the be- 
ginning of §7). In all these cases, where every ideal of the field 2 is 
a principal ideal, numbers are governed by the same laws that govern 
the theory of rational integers, because every indecomposable number 
also has the character of a prime number (see the Introduction and §7). 
This will follow easily from the results below, but I mention it now to 
encourage the reader to make continual comparisons with the special 
cases, and especially with the old theory of rational numbers, because 
without doubt it will help greatly in understanding our general theory. 

Since each ideal is a module (by virtue of I), we immediately carry 
over to ideals the notion of divisibility of modules (§1). We say that an 
ideal m is divisible by an ideal a, or that it is a multiple of a, when all 
the numbers in m are also in a. At the same time we say that a is a 
divisor of m. According to this definition, each ideal is divisible by the 
ideal o. If a is a number in the ideal a then the principal ideal oa will 
be divisible by a (by II). For this reason we say that the number a, and 
hence every number in a, is divisible by the ideal a. 

Likewise we say that an ideal a is divisible by the number n when a 
is divisible by the principal ideal 07. When all the numbers a in an 
ideal a are of the form np it is easy to see that the system t of all the 
numbers p = a/n will form an ideal. Conversely, if p runs through all 
the numbers in an ideal t while 7 is a fixed nonzero number, then all the 
products np will again form an ideal, divisible by 07. We shall denote 
an ideal formed in this way from an ideal t and number 7 by t7 or nt. 
We evidently have (t7)n’ = t(nn’) = (en’)n, and nr’ will be divisible by 
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ne if and only if v’ is divisible by t. Thus the equation yt’ = ne entails 
the equation t’ = t. The notion of a principal ideal oy is the special case 
of ty where ¢ = 0. 

We finally remark that divisibility of the principal ideal oy by the 
principal ideal on is completely equivalent to divisibility of the number 
p by the number n. The laws of divisibility of numbers in o are therefore 
included in the laws of divisibility of ideals. 

The least common multiple m and the greatest common divisor 0 of 
two ideals a, 6 are also ideals. Certainly m and 0 are modules (§1, 3 
and 4), and they are divisible by 0, since a and 6 are divisible by o. 
Also, if 4 = a = £ is a number in m and hence also in a and 5, and if 
6 = a' +f’ is a number in the module 0 then the product pw = aw = Bw 
will likewise be in m and the product dw = a’w + (’w will be in 0 since 
(by virtue of II) the products aw, aw are in a and the products Bw, Bw 
are in b. Thus m and 0 enjoy all the properties of ideals. At the same 
time it is clear that mn will be the least common multiple of the ideals 
an, bn, and 07 their greatest common divisor. 

If 6 is a principal ideal on, then the least common multiple m of a, b 
will always be of the form nt, where t is another ideal and in fact a divisor 
of a, since 7a is a common multiple of a and o7, and hence divisible by 
net. This case occurs frequently in what follows, and for that reason we 
say, for brevity, that the ideal t dividing the ideal a corresponds to the 
number 7. If t’ is the divisor of t corresponding to the number 7’, then 
vt’ will also be the divisor of a corresponding to the product 77’. This 
is because 77’t’ is the least common multiple of nt and onn’, and hence 
also of a and ony’, since nv is the least common multiple of a and on, 
and o7n’' is divisible by on. 


§20. Norms 


Since each ideal a is also a module, we say that two numbers w, w’ in 
the domain o are congruent or incongruent modulo a according as their 
difference w — w’ belongs to a or not. We express the congruence of w 
and w’ modulo a (§2) by the notation 


w =w’ (mod a). 


As well as the theorems on congruences established previously for ar- 
bitrary modules, we must also note that two congruences modulo the 
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same ideal a, 


wea, w’=w” (mod a), 


can be multiplied to give the congruence 
ww” = w'w"” (mod a), 


since the products (w—w’)w” and (w” —w’")w’, and hence also their sum 
ww” — w'w", are numbers in the ideal a. Moreover, if m is a principal 
ideal oy, then (by virtue of §18) the congruence w = w’ (mod m) will be 
identical with the congruence w = w’ (mod yp). 

A very important consideration is the number of different classes, mod- 
ulo the ideal a, which make up the domain o. If ps is a particular nonzero 
number in the ideal a then the principal ideal oy will be divisible by a, 
and since a is divisible by 0 it follows (§2,4) that 


(0, of) = (0,a)(a, op). 


But the number (0, oj4) = +N(u) by §18, and hence the domain o con- 
tains only a finite number of mutually incongruent numbers modulo the 
ideal a (§2,2). This number (0, a) will be called the norm of the ideal a 
and we denote it by N(a). The norm of the principal ideal oy is equal 
to +N(,), and o is evidently the only ideal with norm 1. 

If p runs through a complete system of N(a) incongruent numbers 
(mod a) then so does (1+), and adding the corresponding congruences 
1+p=/(’, where p’ runs through the same values as p, yields N(a) = 0 
(mod a). That is, N(a) is always divisible by a. As a special case, this 
result includes the evident theorem that N (1) is divisible by ys (see §17). 

Now suppose that t is any ideal and 7 is a nonzero number. Always, 


(on, t) = (0, t) = N(t), 


since two numbers nw’ and nw” in the principal ideal no are congruent 
(mod nr) if and only if the numbers w’, w” in o are congruent (mod rt). 

Let a, b be any two ideals, let m be their least common multiple, and 
let 0 be their greatest common divisor. By §2, 3 and 4, we have 


(6, a) = (6, m) = (2, a) 
and, since 0 is divisible by 0, 
(o,a) = (0,d)(2,a), (0,m) = (0, b)(b, m), 


hence 
N(a) = (6,a)N(0), N(m) = (6, a)N(6) 
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and 

N(m)N(0) = N(a)N(6). 
If we apply these theorems to the case where 6 is a principal ideal on, 
so that m is of the form ty, then, since the ideal r is the divisor of a 
corresponding to the number 7 (§19), we get 


(b, a) = (on, tn) = N(v), 


and consequently 
N(a) = N(t)N(Q). 


The ideal t can also be defined as the system of all roots p of the con- 
gruence np = 0 (mod a), as is easy to see. 


§21. Prime ideals 


An ideal p is called prime when it is different from o and divisible by no 
ideals except o and p. This definition yields the following theorems: 


1. Each ideal a different from o is divisible by a prime ideal. 


This is because the ideals different from o that divide the ideal a 
include one, p, whose norm is smallest, and the latter is certainly a 
prime ideal. Indeed, if 0 is an ideal dividing p, but different from p and 
o, then we have (0,p) > 1. Hence N(p) = (0,p)N(0) > N(d) and 0 
will be a divisor of the ideal a, different from o and with norm < N(p), 
contrary to hypothesis. Thus p is a prime ideal. Q.E.D. 


2. If the number 7 is not divisible by the prime ideal p, then np will 
be the least common multiple of the two ideals p and 07. 


The least common multiple of p and o7 is in any case of the form nr, 
with the ideal t a divisor of ap and hence equal to o or p. But t cannot 
be o, because 7 is not divisible by p; consequently rt = p. Q.E.D. 


3. If neither of the two numbers 7, p is divisible by the prime ideal p, 
then their product np will not be divisible by p. 


Otherwise the ideal n(0p) would be a common multiple of p and o7. 
Consequently it would be divisible by the least common multiple np of p 
and on. But the divisibility of n(op) by np implies (§19) the divisibility 
of op by p, contrary to hypothesis. Thus 7p is not divisible by p. Q.E.D. 
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It follows immediately from this that all the rational numbers divisible 
by a prime ideal p, amongst which we have the number N(p) (§20), form 
a module [p], where p is a positive rational prime number. This is because 
the smallest positive rational p divisible by p cannot be a composite 
number ab, otherwise one of the smaller numbers a, b would likewise be 
divisible by p, and p cannot be 1 otherwise we should have p = o (§19). 
And each rational integer m divisible by p must be divisible by p, as 
becomes evident by putting m in the form pq +r, since the remainder 
r = m-—~pgq is also divisible by p. Now with op being divisible by p, and 
hence N(op) = p” being divisible by N(p) (§20), N(p) will be a power 
pf of p, and the exponent f will be called the degree of the prime ideal 
p. 

4. If the ideal a is divisible by the prime ideal p then there is a number 
7 such that np is the least common multiple of a and on. 


This important theorem is evident when we have a = p, because any 
number 77 not divisible by p, for example 7 = 1, satisfies the condition. 
But if a is different from p we first confine ourselves to showing the exis- 
tence of a number 7 such that the divisor t of the ideal a corresponding 
to 7 is at the same time divisible by p, while having norm /ess than that 
of a. Since we have N(a) = N(t)N(0), where 0 is the greatest common 
divisor of a and o7 (§20), the latter condition depends on choosing 7 so 
that N(0) > 1, which means that 0 is different from o. To attain this 
goal, and at the same time make t divisible by p, we distinguish two 
cases: 

First, if all the ideals (except 0) that divide a are divisible by p, then 
we choose 7) to be a number divisible by p but not divisible by a, which 
is always possible because p is not divisible by a. Then it is clear that 
0 will be divisible by p, and hence will be different from o. Moreover, 
since 7 is not divisible by a, and nt is, t will also be different from 0, 
and hence divisible by p. 

Second, if there is an ideal e different from o which divides a and is 
not divisible by p, then we choose 7 to be a number divisible by e but 
not divisible by p. Then 0 will be divisible by e and hence again different 
from 0. Moreover, since nt is divisible by a and hence also by p, t will 
also be divisible by p, because 7 is not divisible by p (by 1). 

Having established the existence in both cases of at least one number 
7 with the required property, we see without difficulty that we certainly 
have t = p if we choose 7 so that N(t) is as small as possible. Because, 
if the ideal t divisible by p is not equal to p, we can proceed with r the 
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same way as with a, and choose a number 7’ in such a way that the 
divisor v’ of t corresponding to 7’ again has norm less than that of t, 
while likewise being divisible by p. But since (§19) t’ is at the same time 
the divisor of a corresponding to the number 77’ this contradicts the 
assumption we have made about 7 and t. Thus t = p, that is, np is the 
least common multiple of a and on. Q.E.D. 


§22. Multiplication of ideals 


If a runs through all the numbers in an ideal a, and @ runs through 
those of an ideal 6, then all the products of the form a, together with 
their sums, form an ideal c. These numbers are in o and they are closed 
under addition; also under subtraction, because the numbers (—a) are 
likewise in a. Finally, each product of a number La in ¢ by a number 
w in o also belongs to ¢, since each product aw again belongs to a. This 
ideal ¢ is called the product of the two factors a, 6, and we denote it by 
ab. 

It follows immediately from this definition that oa = a, ab = ba and, 
if c is any third ideal, (ab)c = a(6c), whence we conclude by well-known 
arguments{ that in a product of any number of ideals aj, d2,...,Qm 
the order of the multiplications, which combine two ideals into a sin- 
gle product, has no influence on the final result, which can be written 
simply as a,q2-°--a, and evidently consists of all numbers of the form 
Na Q2...Am, where a1, Q2,...,@m are numbers from the respective fac- 
tors a1,02,...,4m. If all the m factors equal a their product will be 
called the m“ power of a, and we denote it by a”. We also put a° = 0, 
a! = a and in general we have a"a° = a"t®, (a")® = a"*. In addition, we 
evidently have a(o7) = an and (on)({o7’) = onn’. Finally, we re-establish 
the following theorems: 


1. The product ab is divisible by the factors a and b. 

This is because (by virtue of property IT) each product aw, hence each 
product a3, and hence (by I) each sum of such products belongs to a. 
That is, ab is divisible by a. 


2. If a is divisible by a’, and b by 6’, then ab is divisible by a’b’. 
This is because all the numbers Naf in ab are in a’b’, since a is ina 
and hence in a’, and is in b and hence in b’. 


f See §2 of Dirichlet’s Vorlesungen tiber Zahlentheorie. 
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3. If neither of the ideals a, 6 is divisible by the prime ideal p, then 
the product ab is also not divisible by p. 

This is because there are numbers a, (3, in a, b respectively, which are 
not divisible by p, and then the number a in ab is also not divisible by 


p (§21,3). 


§23. The difficulty in the theory 


It would be easy to augment considerably the number of theorems con- 
necting the notions of divisibility and multiplication of ideals, and we 
mention without proof the following propositions, simply to emphasize 
the resemblance with the corresponding propositions in the theory of 
rational numbers. 

If a, 6 are relatively prime ideals, that is, having greatest common 
divisor o, then their least common multiple is ab, and at the same time 


N(ab) = N(a)N(b). 


If p is a prime ideal, and a is any ideal, then either a is divisible by p 
or a and p are relatively prime. 

If a is an ideal relatively prime to b and c, then a is also relatively 
prime to be. 

If ab is divisible by c and a is relatively prime to c, then b is divisible 
by c. 

However, all these propositions are insufficient to complete the anal- 
ogy with the theory of rational numbers. It is necessary to remember 
that divisibility of an ideal ¢ by an ideal a, according to our definition 
(§19), means only that all the numbers in the ideal ¢ are also in the 
ideal a. It is very easy to see (§22,1) that any product of a by an ideal 
b is divisible by a, but it is by no means easy to show the converse, 
that each ideal divisible by a is the product of a by an ideal b. This 
difficulty, which is the greatest and really the only one presented by the 
theory, cannot be surmounted by the methods we have employed thus 
far, and it is necessary to examine more closely the reason for this phe- 
nomenon, because it is connected with a very important generalisation 
of the theory. By attentively considering the theory developed until now, 
one notices that all the definitions retain their meaning, and the proofs 
of all theorems still hold, when one no longer supposes that the domain 
o consists of all integers in the field 0. The only properties of o really 
needed are the following: 
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(a) The system 0 is a finitely generated module [w, we, ..., wp] whose 
basis is also a basis for the field 2. 

(b) The number 1 is in 0, hence so are all the rational integers. 

(c) Each product of two numbers in 0 is also in o. 

When a domain o enjoys these three properties we shall call it an 
order. It follows immediately from (a) and (c) that an order consists 
entirely of integers from the field 2, but it does not necessarily contain 
all the integers (except in the case n = 1). Now if a number a in the 
order o is called divisible by a second such number yu only when a = pw, 
with w also in o, and if we modify the notion of congruence of numbers in 
o in the same manner, then one sees immediately that the number (0, oj) 
of mutually incongruent numbers of o modulo p is again +N(y) (§18). 
It is also easy to see that all the definitions and all the theorems of the 
present chapter retain their meaning and truth if we always understand 
number to mean a number in the order 0. Thus, in particular, each order 
o in field 2 has its own theory of ideals, and this theory is the same for 
all orders (which are infinite in number) up to the point we have carried 
it so far. However, while the theory of ideals in the order o of all integers 
in the field 2 leads finally to general laws which coincide completely with 
the laws of divisibility for the rational numbers, the theory of ideals in 
other orders is subject to certain exceptions, or rather, it requires a 
certain limitation of the notion of ideal. The general theory of ideals in 
an arbitrary order, whose development is equally indispensable for the 
theory of numbers and which, in the case n = 2, coincides with the theory 
of orders of binary quadratic forms,} will be left aside in what follows,t 
and I shall content myself with giving an example to call attention to 
the character of the exceptions just mentioned. In the quadratic field 
resulting from a root 
_ -1+Vv-3 

2 


of the equation 6? + 8+ 1 = 0 the module [1, /—3] forms an order o not 
containing all the integers of the field. The modules [2, 1+/—3] = p and 
[2, 2\/—3] = 0(2) must be regarded as ideals in this order 0, inasmuch as 
they enjoy the properties I and II (§19). However, while 0(2) is divisible 
by p, there is no ideal q in o such that pq = 0(2). 


0 


{ Disquisitiones Arithmeticae, art. 226. 

ft I treat this theory in detail in the recently published memoir: “Ueber die An- 
zahl der Ideal-Classen in den verschiedenen Ordnungen eines endlichen Kérpers.” 
(Festschrift zur Sdcularfeier des Geburtstages von C.-F. Gauss. Braunschweig, 30 


April 1877). 
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§24. Auxiliary propositions 
To achieve the completion of the theory of ideals in an order o containing 
all the integers of a field 2 we need the following lemmas, which are not 
true unless we restrict ourselves to such a domain o. 


1. Let w, u, v be three nonzero numbers in o such that v is not 
divisible by u. Then the terms of the geometric progression 


2 3 
V V V 
W, W—-, wl—|}, wl-}, ..., 
Lb (Z) (Z) 


up to a term 


at some finite position, will all be in 0, and beyond that none of them 
will be an integer. 


In fact, if the number of integral terms exceeds the absolute value k 
of N(w) then there must be (by §18) at least two among them, corre- 
sponding to exponents s and r > s, which are congruent modulo w. But 


such a congruence, 
Vv s 
w{— (mod w), 
(7) 
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implies that the number 


ar 


in the field Q satisfies an r*? degree equation of the form 
i = n® + a 
where w”’ is an integer, and hence (§13,2) 7 itself is an integer, contrary 


to our hypothesis that v is not divisible by u. Thus at most k terms 
of the series can be integers, and hence members of o. Moreover, if the 


sea) 
7 ’ 


with r > 1, is an integer, and if s is any one of the r exponents 
0,1,2,...,7—1 then the term 


---(3) 


term 
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will also be an integer, because 


is an integer (§13,2). This completes the proof of the proposition. 


2. Let yu, v be two nonzero numbers in 0, with v not divisible by p. 
Then there are two nonzero numbers «, 4 in o such that 
K oY 


XB 


and «? is not divisible by 4. 


If 
7 e-1 y e 
A=p[- », Keel 
HG) 7 28g) 


are the last two integral terms of the series 


m aC) GY AGG) = 


and hence in o, then we evidently have e > 1 and 


KY ie y\ett 
Me OR Pe 


Thus «? is not divisible by 4. Q.E.D. 


§25. Laws of divisibility 
With the aid of these lemmas it is easy to bring the theory of ideals in 
the domain o to the desired conclusion, which is found in the following 
laws: 


1. If p is a prime ideal then there is a number \ divisible by p, and a 
number « not divisible by p, such that «p is the least common multiple 
of od and ox. 


Proof. Let ys be any nonzero number in the prime ideal p. Since ou 
is divisible by p there is a number v such that vp is the least common 
multiple of o and ov (§21,4). This number v cannot be divisible by p, 
otherwise the least common multiple of o4 and ov would be ov and not 
py. Now if we choose (§24,2) the two numbers «, » so that Ku = AV 
and x? is not divisible by A, then (§19) the ideal xvp will be the least 
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common multiple of «(o4) = oAv and ox«v. It follows (§19) that Kp is 
the least common multiple of oA and ox. Then p is the divisor of the 
principal ideal 0 corresponding to the number «x. However, x is not 
divisible by p, otherwise x? would be divisible by xp and hence also by 
». 


2. Each prime ideal p has a multiple, by an ideal 0, which is a principal 
ideal. 


Proof. We retain « and from above with the same meaning, and 
let 0 be the greatest common divisor of 0A and ok. We shall show 
that pd? = o\. In fact, since all numbers in the ideal 0 are of the form 
6 = kw + Aw’, where w, w’ are two numbers from 0, if w is any number 
in p we have wé = kmw + Aww’ = 0 (mod 4), because kp and hence 
also Ko is divisible by oA. Thus pd is divisible by o\. Conversely, if « 
is not divisible by p, in which case o is the greatest common divisor of 
o« and p, we can express the number 1 in 0 as kw + @, with w in p and 
@ in p. We then have \ = \1- Kw +m-A= 0 (mod pd), because the 
first factors \, @ are in p and the second factors kw, \ are in 0. Thus 
each of the two ideals pd, oA is divisible by the other, and consequently 
pd = od. Q.E.D. 


3. If the ideal a is divisible by the prime ideal p then there is exactly 
one ideal a’ such that pa’ = a, and at the same time N(a’) < N(a). 


Proof. Suppose, as we now can, that pd = od. Then if a is divisible 
by p, and hence ad by pd (§22,2), we have ad = Aa’ where a’ is an ideal 
(§19). Multiplying by p, we get Aa = Apa’ and hence also a = pa’. Now 
let 6 be an ideal also satisfying the condition pb = a. Multiplying the 
equation pb = pa’ by 0 we get Ab = Aa’, whence b = a’. In addition, 
there is (§21,4) a number 77 such that 7p is the least common multiple of 
a and on. But with np being divisible by a = a’p it follows, multiplying 
by 0, that 07. is divisible by Aa’, and hence that 7 is divisible by a’. 
However, 77 is certainly not divisible by a, otherwise on and not np would 
be the least common multiple of a and on. Thus, since 7 is divisible by 
a’ and not by a, it follows that a’ is different from a, and consequently 
we have N(a’) < N(a), because a’ is a divisor of a. Q.E.D. 


4. Each ideal a different from o is either a prime ideal or else express- 
ible as a product of prime ideals, and in only one way. 


Proof. Since a is different from o there is (§21,1) a prime ideal p 
dividing a, and hence we can put (by 3) a = pia), where N(a,) < N(a). 
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If we have a; = o then a = p, will be a prime ideal. But if N(a,) is > 1, 
so that a, is different from o, we can likewise put a, = peag where po is 
a prime ideal and N(a2) < N(a,). If N(a2) > 1 we can continue in the 
same manner, obtaining ideals a1, a2,a3,..., whose norms are smaller 
and smaller, finally arriving at the ideal o = a,, after a finite number of 
decompositions. We then have a product of m prime ideals 


a= pip2:-' Pm. 
Now if at the same time 
@= q142°"'qm, 


where qi, 42,---;4m are also prime ideals, then q, will be a divisor of 
the product pip2--:pm and hence (§22,3) at least one of its factors, pi 
for example, must be divisible by q;. And since p, is divisible only by 
the two ideals o and p,, we necessarily have q, = pi, since q, is different 
from 0. We then have 


Pi(Pob3 --- Pm) = P1(q2q3--- dm), 


whence (by 3) 
P2P3 °° Pm = 4293°°" dm, 
We can continue in the same manner, just as in the theory of rational 


numbers,{ and thus arrive at the result that each prime ideal factor in 
one product occurs in the other, and exactly the same number of times. 


QED. 


5. Each ideal a, when multiplied by a suitable ideal m, becomes a 
principal ideal. 

Proof. Suppose that a = pip2---pm. By 2, we can multiply the prime 
ideals p1, p2,..., Pm by the corresponding ideals 0), 02,...,0., to change 
them into principal ideals p101, p202,...,Pm%m. If we now put 


m = 0102°--9m, 


then am = (p121)(p292) ---(Pm%m) will be a unique product of principal 
ideals, and hence itself a principal ideal. Q.E.D. 


6. If the ideal ¢ is divisible by the ideal a then there is an ideal 6, and 
only one, satisfying the condition ab = ¢. If the product ab is divisible 
by the product ab’ then 6 will be divisible by 6’, and ab = ab’ implies 
b= 0’. 


¢ See Dirichlet’s Vorlesungen tiber Zahlentheorie, §8. 
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Proof. We choose the ideal m so that am is a principal ideal ou. Now 
if ¢ is divisible by a, and hence cm by am (§22,2), we can, by §19, put 
cm = yb, where b is an ideal. Multiplying by a, we get uc = ab, whence 
c= ab. Now let a, 6, b’ be any ideals, and suppose that ab is divisible 
by ab’. Again it follows, multiplying by m (§22,2), that jb is divisible 
by yb’, and hence (§19) that 6 is divisible by 6’. If in addition ab = ab’, 
then each of the ideals b, b’ will be divisible by the other, that is, 6 = b’. 
Q.E.D. 


7. The norm of a product of ideals is equal to the product of the 
norms of the factors: N(ab) = N(a)N(6). 


Proof. We first consider the case of a product a = pa’, where the factor 
p is a prime ideal. Since a is divisible by p, there is (by 3) a number 7 
divisible by a’ but not by a, and np is the least common multiple of a 
and on. Then we have (§20) N(a) = N(p)N(0), where 0 is the greatest 
common divisor of the ideals a and 07. Since a and o7 are divisible by a’, 
2 must also be divisible by a’ (§1,4), and hence there is (by 6) an ideal n 
satisfying the condition na’ = 0. Moreover, since a is divisible by 2, pa’ 
is divisible by na’, so the prime ideal p must (by 6) be divisible by n, and 
thus we must have n = p or e. The first equation is impossible unless 
we have 0 = pa’ = a, so that 7 is divisible by a, and this is not the case. 
We therefore have n = 0, whence 0 = a’ and also N(pa’) = N(p)N(a’), 
which proves the theorem in the case considered. However, the general 
theorem is an immediate consequence. Since each ideal (other than 0) 
is (by 4) of the form 


a = pip2---Pm, 

where pj, f2,.-.-, Pm are prime ideals, it follows that 
N(a) = N(p1)N (pops ---Pm) = N(b1)N(p2)N(p3---Pm) = - °° 
and hence 
N(a) = N(p1)N(p2) +» N(Pm). 

Moreover, if we have 

b = qig2--- qr, 
where qi, q2,°'°:,q, again denote prime ideals, then we get 


ab = pip2-+- Pmdqid2°*-4r, 
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and consequently 


N(b) = N(qi)N(qg2)---N (ar); 
N(ab) = N(p1)--- N(Pm)N(q1)--- N(ar); 
which obviously implies 
N(ab) = N(a)N(6). 
Q.E.D. 


8. An ideal a (or a number a) is divisible by an ideal 0 (or a number 
6) if and only if each power of a prime ideal which divides 2 (or 6) also 
divides a (or a). 

Proof. If p is a prime ideal and p™ is a divisor of an ideal 3 then we 
have (by 6) 0 = 0;p™, where 0; is an ideal. If we suppose the latter 
decomposed into all its prime factors, then 0 will also be expressed as 
a product of prime ideals, among which the factor p appears at least m 
times. Conversely, if the decomposition of 0 into prime factors includes 
the prime ideal p at least m times, then 0 will evidently be divisible 
by p™. Thus if we suppose that every power of the prime ideal which 
divides 0 also divides an ideal a, this amounts to saying that all the 
prime factors in the decomposition of 0 appear, at least as often, as 
factors in the decomposition of a. The factors of a include first of all 
the factors of 0 and, if we denote the product of the other factors by 0’, 
then we have a = 00’, and consequently a is divisible by 0. The converse 
proposition, that if 0 is a divisor of a then each power of a prime ideal 
that divides 0 also divides a, is verified easily. Q.E.D. 


If we combine all factors of the same prime in the decomposition of 
an ideal a then we find 

a=p%q?r’.-., 

where p,q,t,... are different prime ideals. By virtue of the preceding 

theorems we can show that all the divisors 0 of a are given by the formula 


a= pq’ rc” teey 
where the exponents a’,b’,c’,... satisfy the conditions 
O<a'<a, 0<U <b, O0<e'<e, 


Since any two different combinations of exponents a’,b’,c’,... corre- 
spond (by 4) to different ideals 0, the total number of different divisors 
will be (a+ 1)(b+1)(c+1)---. 
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9. If 0 is the greatest common divisor of the two ideals a, b then we 
have 
a=da’, b=0db’, 


where a’, 6’ are relatively prime ideals, and the least common multiple 
m of a, b is da’b’ = ab’ = ba’. Moreover, if ae is divisible by 6, e will be 
divisible by 6’. 

We leave the task of finding the proof of this proposition to the reader, 
along with the rules that allow m, 0 to be derived from the decomposi- 
tions of a, 6 into prime factors. 


§26. Congruences 
Having established the laws of divisibility for ideals, and hence also for 
numbers in o, we shall add some considerations on congruences which are 
important for the theory of ideals. For the moment we shall be content 
simply to give indications of the proofs. 


1. Since o is the greatest common divisor of two relatively prime ideals 
a, 6, and ab is their least common multiple, then (§2,5) the system of 
two congruences 


w = p (mod a), w =o (mod B), 


where p, o are two given numbers in 0, always has roots w, and all these 
roots come under the form 


w =T (mod ab), 


where 7 represents a class of numbers modulo ab which is determined by 
p and o, or by their corresponding classes modulo a and 6b respectively. 
Conversely, each class 7 (mod ab) is determined in this way by precisely 
one combination p (mod a), o (mod 6). 

We now say that the number p is prime to the ideal a when op and a are 
relatively prime ideals, and we let (a) denote the number of mutually 
incongruent numbers modulo a which are prime to a. One easily derives 
the theorem that 


(ab) = p(a)v(b) 


for two relatively prime ideals a, 6 since 7 is prime to ab if and only if p 
is prime to a and o is prime to 6. Thus we only need to find the value of 


826. Congruences 135 


the function (a) in the case where a is a power p™ of the prime ideal 
p. The total number of mutually incongruent numbers modulo p™ is, in 
the case m > 0, equal to 


N(p™) = [N(p)]” = (0,p™) = (0, p)(p, p™) = (p, p™)N(p). 


It is necessary to subtract from this the number of numbers not prime 
to p™, and hence divisible by p. Since this number is equal to 


(p,p™) = [N(p)]"—?, 


we get 
He) = LN(@)I” — [NII = NO") (1 He) 


whence we conclude immediately, by virtue of the preceding theorem, 
that 


wa) = N@ TT (1- 375) 


where the product [] is taken over all the distinct prime ideals p that 
divide the ideal a. Since we also have 


Y(o) = 1, 
we alrive, just as in the theory of rational numbers,} at the theorem 
Yo ¥(a') = N(a), 
where the summation is taken over all the ideals a’ that divide a. 


2. If 0 is the greatest common divisor of the ideals a and o7 then we 
have a = da’, and ya’ will be the least common multiple of a and on, 
that is, a’ will be the divisor of a corresponding to the number 77 (§19). 
Conversely, if na’ is the least common multiple of a and o7n then we have 
a = 00’, where 0 is the greatest common divisor of a and o7. It is also 
clear that the complementary factors 0 and a’ of the ideal a remain the 
same for all numbers congruent to 7 modulo a. It is evident that it will 
also be the same if we replace 7 by a number 7’ = nw (mod a) where w 
is a number prime to a’. And conversely, if the greatest common divisor 
2 of a, 07 is also that of a, on’ then 


n =nw, n=n'w' (moda), 


t See Dirichlet’s Voriesungen tiber Zahlentheorie, §14. 
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whence we deduce 
nw’ =n (mod a), ww’ =1 (mod a’), 


and consequently w is a number prime to a’. Thus the number of mu- 
tually incongruent numbers 7 modulo a which correspond to the same 
divisor a’ of a is ~(a’). However, it is necessary to point out that here 
we have assumed the existence of at least one such number 7. Thus if 
we are given an arbitrary divisor a’ of the ideal a the most we can affirm 
so far is that the number x(a’) of mutually incongruent numbers 7 mod- 
ulo a which correspond to the same divisor a’ will equal either (a’) or 
zero. To decide this alternative we consider all the incongruent numbers 
modulo a, which are N(a) in number, and partition them into groups of 
x(a’) numbers, corresponding to the divisors a’. We must have 


d x(a’) = N(a) 


where the summation is taken over the divisors a’ of a. But since we 
also have (1) 


y_v(0') = N(a), 


it follows immediately that x(a’) is never zero and always y(a’). Thus 
we have proved the following very important theorem: 


“If 0 and a’ are any two ideals we can always, by multiplying 0 by an 
ideal b’ prime to a’, change it into a principal ideal 0b’ = on.” 


Putting da’ = a, the fact that w(a’) is nonzero means that there is 
always a number 7, corresponding to the divisor a’ of a, such that 0 will 
be the greatest common divisor of a and 07. If we then put o7 = 0b’, b’ 
will be an ideal prime to a’. Q.E.D. 


3. Since each product pp’ of numbers p, p’ prime to an ideal a is 
likewise a number prime to a, and since, as p remains constant and p’ 
varies, pp’ runs through a system of #(a) mutually incongruent numbers 
(mod a), we deduce by the well-known method,} and for each value of 
the number p, the congruence 


p* = 1 (mod a), 
which represents the highest generalisation of a celebrated theorem of 


{ See Dirichlet’s Vorlesungen tiber Zahlentheorie, §19. 
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Fermat. For a prime ideal p we conclude easily that every number w in 
the domain o satisfies the congruence 


w(*) = w (mod p), 


that is, the congruence 
w =y (mod p), 


where p is the rational prime number divisible by p and f is the degree 
of the prime ideal p (§21,3). This theorem is as important for the theory 
of the domain 0 as the theorem of Fermat is for the theory of rational 
numbers, as we shall at least try to make clear by the following remarks, 
space not permitting us to pursue the general theory further. 

If the coefficients of the polynomial function F(z), of degree m, are 
in o, and if the coefficient of the highest degree term is not divisible by 
the prime ideal p then we deduce, by well-known reasoning,} that the 
congruence F(w) = 0 (mod p) cannot have more than m mutually incon- 
gruent roots. This proposition, combined with the preceding theorem, 
leads to a complete theory of binomial congruence modulo p. We de- 
duce, among other things, the existence of primitive roots for the prime 
ideal p, meaning numbers -y whose powers 


1,7, y, oa yN)-2 


are mutually incongruent. In general, the theory of congruences of higher 
degree with rational coefficients carries over completely to functions F(z) 
whose coefficients are numbers in the domain o. 

However, we have previously ascertained a close connection between 
the theory of ideals and the theory of higher degree congruences, re- 
stricted to the case of rational coefficients, developed in the works of 
Gauss, Galois, Schonemann and Serret.{ Since all ideals are composed 
of prime ideals, and each prime ideal p divides a determinate rational 
prime p, we obtain a complete overview of all the ideals in the domain 
o by decomposing all ideals of the form op into their prime factors. The 
theory of congruences provides a procedure capable of doing this in a 
great number of cases. If @ is an integer of the field Q and 


A(1,6,67,...,0°-!) = k?A(Q) 
then, if p is not a divisor of k we decompose op into prime ideals as 
t See Dirichlet’s Voriesungen tiber Zahlentheorie, §26. 


{ See my memoir Abriss einer Theorie der héheren Congruenzen in Bezug auf einen 
reellen Primzahl-Modulus (Crelle’s Journal, 54). 
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follows. If f(t) is the polynomial of degree n in ¢t that vanishes for t = 0 
we can put 


SQ SPO" Po)” ---Pe(t)* (mod p), 


where P(t), Po(t),...,P.(¢) are distinct irreducible polynomials with 
respective degrees f1, fo,..-, fe. Then we certainly have 


op = py' po? ++ pe, 
where p1,~2,...,P- are distinct prime ideals with respective degrees 
fi, fo,.--,fe- We deduce from this the following extremely important 


theorem: 


“The rational prime p divides the fundamental number A(Q) of the 
field Q if and only if p is divisible by the square of a prime ideal.” 


This theorem is still true, although the proof is more difficult, when 
the numbers k corresponding to all possible numbers @ are all divisible 
by p. Such a case is actually encountered,} and this is one of the reasons 
I was determined to build the theory of ideals, not on congruences of 
higher degree, but on entirely new principles which are at the same time 
as simple as possible and better suited to the true nature of the subject. 


§27. Examples borrowed from circle division 


By the general theory of ideals, the fundamentals of which I have devel- 
oped above, the phenomena of divisibility of numbers in each domain 0, 
consisting of the integers of a field 2 of finite degree, have been reduced 
to the same fixed laws that govern the old theory of rational numbers. 
If we reflect on the infinite variety of these fields 2, each of which has 
its own theory of special numbers, it is undoubtedly within the spirit of 
geometry to ascertain those general laws obeyed by the various theories 
without exception. However, this is not only of aesthetic or purely the- 
oretical interest, it is also of practical value. The knowledge that these 
general laws exist greatly assists in the discovery and proof of special 
phenomena in a given field 2. To back up this claim to the full we 
would admittedly need to develop further the general theory of ideals, 
and to combine it in particular with the algebraic principles of Galois. 
Instead, I shall simply try to show, using the example that led Kummer 


t See the Géttingische gelehrte Anzeigen of 20 September 1871, p. 1490. 
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to introduce his ideal numbers in the first place, how the elements of the 
theory explained above lead to the goal with great facility. 

Let m be a positive rational prime number, and let 2 be the field of 
degree n resulting, in the manner described above (§15), from a primitive 
root of the equation 6” = 1, that is, a root of the equation 

f(0) = 0" 1407-7 4..-467+041=0. 


Since the coefficients are rational, we have n < m— 1. Moreover, since 
0,07,...,0™— are all roots of this equation we have 


t™ — 
where ¢ is a variable, and consequently 
m= (1—6)(1— 67)---(1-0"71). 


The m — 1 factors of the right-hand side are integers and associates of 


= = (¢—6)(t— 6)... (¢-"-1), 


each other, because if r is one of the numbers 1,2,...,m—1 then 
1-@ 2 r—1 
——— =14 0467 4---4+86 
1-90 
will be an integer, and if s is positive and chosen so that rs = 1 
(mod m), then 
1-0 rae L=0" = ir 2r (s—1)r 
will also be an integer. Thus if we make the abbreviation 
1-0=4p 
we get 
mae pet 


where ¢ is a unit of the field 2 and hence, by taking norms, 
m” = [N(u)]"™—?. 


However, since m is a prime number, N(j) must be a power of m. If 
we put N(y) = m® it follows that n = e(m — 1), and sincen <m-—1 
by the remark above, we conclude that e = 1 and n = m—1 = ¢(m). 
The preceding equation f(@) = 0 is therefore irreducible, the numbers 
0,07,...,9-1 are conjugate, and these numbers correspond to m — 1 
isomorphisms transforming the normal field (Q into itself. At the same 
time we have 


N(u) =m, om=op™!, 


140 Chapter 4. Elements of the theory of ideals 


The principal ideal op is a prime ideal. In fact, if we have o. = ab, where 
a and 6 are ideals different from 0, it follows that m = N(a)N(b), and 
since m is a prime number we necessarily have, say, N(a) = m, N(b) = 1, 
whence b = 0, contrary to hypothesis. At the same time (§21,3) m is the 
smallest rational number divisible by y; the numbers 0,1,2,...,m—1 
form a complete system of mutually incongruent numbers modulo yp. It 
follows from this that a number of the form 


w=kotkhyt kop? os gat od kiniop 


where ko, ki, ke,...,km—g2 are integers, is not divisible by m, and hence 
also not by y”~1, unless all the numbers ko, ki,..., km—2 are divisible 
by m. Because an w divisible by m must also be divisible by yp, and 
this makes kg divisible by yz, and hence also by y?, and this makes k; 
divisible by yz, and hence also by m. Continuing in this way, we conclude 
that the other numbers k,, ke,...,km—2 are divisible by m. 

With the help of this result it is easy to show that the m— 1 numbers 
1,0,0?,...,0—? form a basis for the domain o of all integers in the field 
Q. Since we have 


t™—1=(t-1)f(t), mo™' = (6-1)f'(), 
it follows, by excluding the uninteresting case m = 2, that 
N[f'(0)| =m", 
and since N(6) = 1 and N(@ — 1) = m it follows from §17 that 


A(1,9,02,...,07-2) = (-1)"2 m™-?, 


Moreover, since up = 1— 0, 0 = 1 — p it is clear that the two modules 
(1,6....,6™-?] and [1, u,..., u""—?] are identical, whence it follows (§4,3 
and §17,(5)) that we also have 


A(1, ws, w?,..., pH?) = (-1) BE 2 
Since the numbers 1, p, 4u?,...,u—? are independent, each number in 
the field 2 can be put in the form 


ko tkipt kop? +e: + km—2p™? _wW 
oe et et 
where k, ko, ki, k2,..-,km-—2g denote rational integers without common 
divisor. For this number to be an integer, that is, for w to be divisible 
by k, it is necessary (§18) that k? divide the discriminant of the basis 
1, py, u?,...,"-?, and therefore k cannot contain prime factors other 
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than the number m. Moreover, since it has been shown that w is not 
divisible by m unless the numbers ko, ki,...,km—2 are divisible by m, 
k must also not be divisible by m. Thus we must have k = +1, and all 
integers of the field are of the form 


w = ky + hip + kop? +++ + km—2p™?, 
whence we have 
o = [1,u,...,477] = [1,0,...,0"77], 
or again, since 1+ 6+67+---+9™-2 497-1 = 0, 
o = [0,07,...,0"-}, A(Q) = (-1)"= m™?. 


Now let p be any prime ideal different from oy. The positive rational 
prime p divisible by p is different from m, and we have 


N(p) =p’, 


where f is the degree of the prime ideal p. Two powers 6", 6° are 
congruent modulo such a prime ideal only if they are equal, that is, 
if r = s (mod m). This is because r # s (mod m) implies 6" — 0° = 
6” (1—0"—*) = ew where € is a unit, in which case 0” cannot be congruent 
to 6° (mod p). Now since we have (§26,3) 


oN) = 9 (mod p), 
it follows that 
p! =1 (mod m). 


Let a be the divisor of ¢(m) = m — 1 to which the number p belongs 
modulo m, that is, let a be the least positive exponent for which 


p* =1 (mod m). 


As we know, f must be divisible by a, and hence f > a. But since all 
integers in the field 2 have the form 


w = F(0) = 210 + 2207 +--+ + 2m_10"7! 
where £1,22,...,%m are rational integers, it follows from well-known 
theorems, true for every prime number p, that 
w? = F(6?), w?" = F(6"") (mod p), 
and hence 
w?” =w (mod p). 
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We conclude first of all that the ideal op is a product of distinct prime 
ideals, because if op = pq there would be a number w divisible by pq but 
not by p, so w? and hence also w?" would be divisible by p2q? = pq, and 
then also by p, contrary to the preceding congruence. Moreover, since p 
is divisible by p, every integer w in the field 2 satisfies the congruence 


w?" =w (mod p), 


which therefore has N(p) = p’ mutually incongruent roots w. And since 
its degree is p* we must have pf < p*, thus implying f < a. But it has 
already been shown that f > a, so f =a. We have therefore arrived at 
the following result, which is the main theorem of Kummer’s theory:t} 


“If p is a prime number different from m and if f is the exponent to 
which p belongs modulo m, so that ¢(m) = ef for some e, then 


op = pip2-- Pe, 
where 1, p2,..-,Pe are distinct prime ideals of degree f.” 


The rest follows easily. The general case where m is an arbitrary 
composite number can be treated similarly. The degree of the normal 
field 2 is always equal to the number ¢(m) of those numbers among 
1,2,3,...,m that are prime to m. The preceding law is proved without 
any change, and the determination of the prime ideals that divide m 
does not present any extra difficulty. 

From the very general researches that I am going to publish shortly, 
the ideals of a normal field 2 immediately allow us to find the ideals 
of an arbitrary subfield { of Q, that is, any field H whose members all 
belong to 2. For example, this enables us to know the ideals of any field 
H resulting from division of the circle and, to give a more precise idea, 
of the scope of these researches, I mention the following case. 

Again let m be a prime number, so ¢(m) = m — 1, and let e be any 
divisor of m—1 = ef. In the theory of rational numbers, the congruence 


kf =1 (mod m) 


has precisely f mutually incongruent roots h, which are closed under 
multiplication and which, in that sense, form a group. If @ is again a 
primitive root of the equation 6” = 1 and if 2 is the corresponding field 
of degree m — 1, then all the numbers F(6) in this field satisfying the 
t Kummer’s researches may be found in Crelle’s Journal, 35, in Liouville’s Journal, 


XVI and in the memoirs of the Berlin Academy for the year 1856. 
} Dedekind calls it a “divisor”, as in his footnote in §15. (Translator’s note.) 
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conditions F(9) = F(6") form a field H of degree e, and the e conjugate 
periods§ m,72,---,Ne are sums of f terms, one of which is 


n= > 6, 


and they form a basis of the domain e of integers in H. By general 
results I shall speak of later (or else immediately, by conclusions like 
those derived above in the case e = m — 1) we now obtain the prime 
ideals belonging to this subfield H of the normal field 2. If we put 


p= ]][a-9%), 


then p is an integer of the field H, m is an associate of p®, and ep is a 
prime ideal. Moreover, if p is a rational prime different from m, and if 
p? belongs to the exponent f’ modulo m, then f’ will necessarily be a 
divisor of e = e’ f’ and the principal ideal ep will be the product of e’ 
distinct prime ideals of degree f’. In the case e = m—1, f = 1, H is 
identical with 2, and we again obtain the result proved above. We now 
examine more closely the case e = 2, f = met 

In this case the f numbers / are the quadratic residues of m. If we 


let k denote a quadratic nonresidue, then the two conjugate periods 


n= De f= D0 
form a basis of the domain ¢ of all integers in the quadratic field H, and 
hence its discriminant will be 


2 
n 1 | 1\2 
A(A) = =(n- ; 
(H) non (7-1) 
because 7 + 7’ = —1. The number m is an associate of the square of 
7 


the number p = J](1 — 6*), and ep is a prime ideal. Moreover, ep is the 
product of two prime ideals of degree 1, or else ep is a prime ideal of 
degree 2, according as 


p ? =+1lor -1 (mod m), 
that is, using the notation of Legendre, according as 


(4) =+1lor —-1. 
m 


But we can directly study all quadratic fields, without recourse to divi- 
sion of the circle, and we have already (§18) determined the discriminant 
D! of such a field H. From D’ we can also derive the prime ideals} of 


§ Disquisitiones Arithmeticae, art. 343. 
t See Dirichlet’s Vorlesungen tiber Zahlentheorie, §168. 
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the field H. If the rational prime p divides D’ then the corresponding 
principal ideal ep will be the square of a prime ideal. If p does not divide 
D’', and if p is odd, then ep will be the product of two prime ideals of 
degree 1, or else a prime ideal of degree 2, according as 


é 
(2) =+lor —1. 
Pp 


Finally, if D’ is odd and hence = 1 (mod 4) then e(2) will be the product 
of two prime ideals of degree 1, or else a prime ideal of degree 2, according 
as 

D'=1or5 (mod 8). 


Comparing these laws, valid for all quadratic fields, with the result for 
the special field H derived using circle division, we see first that D’ must 
be divisible by m, but not by any other prime number, and hence that 
(§18) 

A(H) = D! = (-1)"*'m. 


In this way we derive, from entirely general principles and without cal- 
culation, the result 
m—1 
(n— 1)? = (-1)"Fom, 
previously demonstrated in the theory of circle division by effective for- 
mation of the square of 7 — 7/.{ Pursuing this comparison further, we 
are led again to the theorem 


@)- (2) 


where +m = 1 (mod 4), and to the theorem 


This proof of the quadratic reciprocity law, in which we also determine 
the quadratic character of the number —1, is essentially the same as the 
celebrated sixth proof of Gauss, ¢ later reproduced in different forms by 
Jacobi, Eisenstein and others. I should say that it was by meditating 
on the essence of that proof and the analogous proofs of cubic and bi- 
quadratic reciprocity that I was led to the general researches mentioned 
above and soon to be published. 


| Disquisitiones Arithmeticae, art. 356. 
t Theorematis fundamentalis in doctrina de residuis quadraticis demonstrationes et 
ampliationes nove, 1817. 
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As a final example, we consider the case m = 4. We then have 6 =i = 
/—1 and the integers of the quadratic field 2 are the complex integers, 
first introduced by Gauss, of the form 


w=x2+ yt, 


where 2x, y are rational integers (§6). The discriminant of the field is 


2 
EF a eee 


1 -i 


The number 2 = i(i — 1)? is an associate of the square of the prime 
number 1 — 1. If p is a positive odd rational prime then we have 


®P = (-1)°Fi, 
and consequently 
we = (e+ yt)? Sat (-1)"? yi (mod p). 
Now if p = 1 (mod 4), each integer w will satisfy the congruence 
w? =w (mod p), 


whence it follows immediately that op is the product of two different 
prime ideals of degree 1. But if p = 3 (mod 4) we have 


wsu', w sw (mod p), 


where w’ is the number conjugate to w, and we conclude easily that op 
is a prime ideal of degree 2. But every ideal a of this field must be a 
principal ideal. In fact, if ag is a member of the ideal a with minimum 
norm, then each number a in the ideal will be divisible by ap. This is 
because (§6) we can choose the integer w so that 


N(a — wag) < N(ao), 


and since the numbers a, ag, and hence also a—wap, belong to the ideal 
a, we must have N(a@ — wao) = 0, whence a = wag and consequently 
a= 0a. Q.E.D. 


Now, since op is the product of two prime ideals of degree 1 in the 
case where p is a rational prime = 1 (mod 4), it follows that 


p = N(ao) = N(a+ bi) = a? +B, 


which is the celebrated theorem of Fermat. 
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§28. Classes of ideals 


We now return to consideration of an arbitrary field 2 of degree n, 
in order to establish the distribution of its ideals into classes. This 
distribution depends first of all on the theorem (§25,5) that each ideal 
a can be converted into a principal ideal by multiplication by an ideal 
m, and on the following definition: two ideals a, a’ are called equivalent 
when they can be converted into principal ideals am = op, a’m = op! by 
multiplication by the same ideal m. Then we evidently have p’a = pa’ 
and, conversely, if there are two nonzero numbers 7, 7/ such that 7’a = 
na’ then the ideals a, a’ are certainly equivalent. Because, multiplying a 
by m to obtain a principal ideal am = oy, it follows that opin! = n/am = 
na’m. Then jen’ is divisible by 7, whence pun! = u'n, ofe’n = na’m, hence 
a’m = op’. Q.E.D. 

If two ideals a’, a” are equivalent to a third, a, then a’ and a” are equiv- 
alent to each other. This is because the hypothesis yields four numbers 
B, 2,9, 7” such that p’a = pa’, na = na”, and hence (7’z)a’ = (/n)a”. 
Q.E.D. 

It follows that the ideals can be partitioned into classes. If a is a given 
ideal, the system A of all ideals a, a’, a”, .. . equivalent to a will be called 
a class of ideals, and a will be called a representative of the class A. Any 
two ideals in A are equivalent, and any ideal a’ in A can be chosen as a 
representative in place of a. 

It is clear that the system of all principal ideals forms a class by itself, 
since each of them is converted to itself when multiplied by the ideal 
o, and hence they are all equivalent. And if an ideal a is equivalent 
to a principal ideal, and hence equivalent to o, then a itself must be a 
principal ideal, since there are two numbers yp, pz’ such that pa = ou and 
this implies that y is divisible by w’, whence p = p/p” and consequently 
a = ou”. Thus the class represented by o includes all principal ideals 
and no others. We call this class the principal class and denote it by O. 

Now if a runs through all the ideals in a class A, and 6 runs through 
all the ideals in a class B, then all the products ab belong to the same 
class K. Because if a’, a” are in A and 6’, b” are in B there are four 
numbers a’, a”, 3’, 8” such that a’a’ = aa", Bb’ = f'b", and it follows 
that (a””)(a'b’) = (a'B’)(a"b”), that is, a’b’ and ab” are equivalent 
ideals. We denote the class K to which all the products ab belong by 
AB and call it the product of A and B, or the class composed from A 
and B. We evidently have AB = BA, and it follows from the equation 
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(ab)c = a(be) that (AB)C = A(BC) for any three classes A, B, C. Then 
we can apply the same reasoning as for the multiplication of numbers 
or ideals, and show that, in the composition of any number of classes 
Aj, Ag,...,Am, the order in which pairs of classes are combined has no 
influence on the final result, which we can denote simply by A; A2--- An. 
If the ideals a1, a2,...,@m represent the classes A,, Ag,..., Am then the 
ideal aja---adm represents the class A,;Ao---Am. If all the m factors 
equal A, then their product is called the m*® power of A, and we denote 
it by A™. In addition, we put A! = A and A° = O. The following two 
cases are particularly important: 

The equation oa = a yields the theorem that OA = A for any class A. 

Moreover, since each ideal a can be transformed into a principal ideal 
am by multiplication by an ideal m, for each class A there is a class M 
satisfying the condition AM = O, and only one, because if the class N 
is such that AN = O it follows that 


N =NO=N(AM) = M(AN) = MO=M. 


The class M is called the class opposite or inverse to A, and we denote 
it by A~!. Conversely, it is clear that A will be the class inverse to A~?. 
If in addition we define A~™ to be the class inverse to A™ then we have 
the following theorems for any rational integer exponents r, s: 


A’ A® _ Arte. (AT)® = A’, (AB)” = A’B’. 


Finally, it is evident that from AB = AC’ we can always deduce B = C, 
multiplying by A-}. 


§29. The number of classes of ideals 


If we take any n integers w1,w2,...,W,, forming a basis for the field Q, 
then each number 


w= hy + howg +---+hnwn 


with rational integer coordinates h,h2,...,n will be an integer in the 
same field. If we allow the coordinates to take all integer values of 
absolute value not greater than a particular positive value k, then it is 
evident that the absolute values of the corresponding numbers w, which 
are real, or their analytic moduli, which are imaginary, are all < rk 
where r is the sum of the absolute values or moduli of w1,w2,...,Wn, 
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and hence a constant independent of k. Moreover, since the norm N(w) 
is a product of n conjugate numbers w of the form above, we also have 


+N(w) < sk”, 
where s is likewise a constant depending only on the basis. We deduce 
from this the following theorem: 


In each class M of ideals there is at least one ideal m whose norm is 
bounded by a constant. 


Proof. Take any ideal a in the inverse class M~!, and let k be the 
positive rational integer determined by the conditions 


k™ < N(a) <(k+1)". 


If we now allow each of the n coordinates hy, hz,..., hy to take all k+1 
values 0,1,2,...,, then we obtain distinct numbers w and, since their 
number is (k+1)” and hence > N(a), there are necessarily two different 
numbers w, 


B= dy t+ +0pWn, Y= Cw +--+ + Cnn, 
which are congruent modulo a. Hence their difference 
a = (by — c1)wy + +++ + (bn — Cn )Wn 


will be a nonzero number divisible by a. But since the coordinates b, c of 


the numbers , -y come from the sequence 0,1,2,...,4, the coordinates 
b —c of a@ all have absolute value not greater than k, and hence 
+N(a) < sk”. 


Since a is divisible by a we have oa = am, where m is an ideal in the 
class M, and hence 


+N(a) = N(a)N(m) < sk”. 
Moreover, since k” < N(a), it follows that N(m) < s. Q.E.D. 


If we now consider that the norm m of an ideal m is always divisible 
by m (§20), it is clear that there cannot be more than a finite number 
of ideals m with given norm m, because each ideal, and in particular 
om, is divisible by only a finite number of ideals (§25,8). Since there are 
also only a finite number of rational integers m not exceeding a given 
constant s, there cannot be more than a finite number of ideals m with 
N(m) < s, which evidently yields the fundamental theorem: 
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The exact determination of the number of classes of ideals is incon- 
testably a very important problem, but also one of the most difficult 
in the theory of numbers. For quadratic fields, whose theory essentially 
coincides with that of binary quadratic forms, we know that the problem 
was first solved by Dirichlet.t His solution, expressed in the terminology 
of the theory of ideals, rests on the study of the function 

1 1 
2B N(a)s II 1— Ney , 

for infinitely small positive values of the independent variable s — 1. 
The sum is taken over all ideals a, the product over all prime ideals p, 
and the identity of the two expressions is an immediate consequence of 
the laws of divisibility. With the aid of these principles, the number of 
classes of forms or ideals has been later determined by Eisenstein{ for 
a particular case of a field of degree 3, and by Kummert for the higher 
degree fields arising from division of the circle. These researches have 
excited the liveliest interest because of their astonishing connections with 
analysis, algebra and other parts of number theory. For example, the 
problem treated by Kummer is closely related to Dirichlet’s proof of the 
theorem on primes in arithmetic progressions, which can be considerably 
simplified with the aid of these researches. There is no doubt that further 
study of the general problem will lead to important progress in these 
branches of mathematics; however, while part of this research has been 
successfully completed for an arbitrary field 2,§ we are nevertheless far 
from the complete solution, and for the moment we are confined to 
studying new special cases. 


§30. Conclusion 


We shall derive some further interesting consequences of the fundamental 
theorem proved above. (See Disquisitiones Arithmeticae, art. 305-307.) 

Let h be the number of classes of ideals of the field , and let A be a 
particular class. The h + 1 powers 


OAM eA a 


{ Crelle’s Journal, 19, 21. 

t Crelle’s Journal, 28. 

ft Crelle’s Journal, 40, Liouville’s Journal, XVI. 

§ Dirichlet, Vorlesungen tiber Zahlentheorie, §167. 
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cannot all be different, hence there are two different exponents r and 
r+m > in the sequence 0,1,2,...,h such that A™t™ = A”, and 
consequently 

A” =O. 


Moreover, if m is the smallest positive exponent satisfying this condition 
it is easy to see that the m classes 


OA, Ar sg Am 


are all different, and we say that the class A belongs to the exponent m. 
Obviously A”-! = A-!, and more generally we have A” = A® if and 
only if r = s (mod m). Then, if B denotes any class, the m classes 


(B) B,BA,BA?,...,BA™-! 


will be all different, and any two complexes of m classes, such as the 
preceding (B) and the following 


(C) C,CA,CA*,...,CA™—}, 


will either be identical or have nothing in common. In fact, if both 
include the same class BA” = C'A® then we have C = BA"~’, whence 
it follows immediately that the m classes in (C’) are the same as those 
in (B). Thus the whole system of h classes is partitioned into g such 
complexes and, since each complex includes m different classes, we have 
h= mg. That is, the exponent m to which the class A belongs is always 
a divisor of the number of classes, h. It follows that we have the theorem 


A* =O 


for each class A. Now if a is any ideal in any class A, then a’ belongs 
to the class A”, and hence to the principal class. That is, the h*® power 
of any ideal is a principal ideal. 

With this important theorem we come to see the notion of ideal from 
a new point of view, at the same time connected with a precise definition 
of ideal numbers. Let a be any ideal and let a* = oa,. Now if a denotes 
any number in the ideal a, a” will be in a”, and hence divisible by the 
number a, and it follows from §13,2 that a is divisible by the integer 
Ht = %/a1, which does not in general belong to the field 2. Conversely, if 
a is an integer belonging to 2 and divisible by yz, then a” will be divisible 
by uw" = ay and consequently (oa)" will be divisible by 0a; = a”, from 
which we easily conclude, by the general laws of divisibility (§25), that 
oa is divisible by a, so that a is a number in the ideal a. Thus the ideal 
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a consists of all the integers in 2 divisible by the integer yx. For this 
reason we say that the number p, though not actually a member of 2, 
is an ideal number of the field Q, and that it corresponds to the ideal a. 
Or, a little more generally, an algebraic integer py is said to be an ideal 
number of the field 2 when there is a power yu", with positive integer 
exponent r, equal to an actual 7 in 2, and at the same time an ideal a 
in the field (2 satisfies the condition a” = oy. The latter ideal a is the 
ideal corresponding to the ideal number yp, and it is a principal ideal if 
and only if yz is an associate of an actual number in the field 2. (See the 
Introduction and §10.) 

We end our considerations with the proof of the following theorem 
announced earlier (§14): 

Any two algebraic integers a, G have a common divisor 6 which can 


be expressed in the form 6 = aa’ + 36', where a’ and f3' are likewise 
algebraic integers. 


Proof. We assume that the two numbers a, 3 are nonzero, otherwise 
the theorem is evident. Then it is easy to see that there is a field 2 of 
finite degree including both a, 3. Let 0 be the domain of integers of this 
field, and let h be the number of classes of ideals. Now put 


oa=ad, of =bd, 2” = 06y, 


where 0 is the greatest common divisor of oa, 08, and 6, is in o. Since 
a*, 8 are divisible by 0°, we can put 

a =a16, B°=fi51, 0a, =a", of, =b", 
where @1, {; are likewise in 0. Also, since a and 6 are relatively prime 
ideals, o will be the greatest common divisor of 0a, of and, since the 
number 1 is in 0, there will be two numbers az, {2 in o satisfying the 
condition 


2102+ BiB2=1, or aaa + B"By = bj. 
If we now put 
6, = 6", 


then the integer 6 will be a common divisor of a and , since a”, 8" are 
divisible by 6,, and hence, since h > 1, we can put 


aga’? = a’? pope-* = pe, 


where a’, 3’ are integers satisfying the condition aa’ + 66’ = 6. Q.E.D. 
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If at least one of the two numbers a, @ is nonzero, then the number 
6, and any of its associates, deserves the name greatest common divisor 
of a, 8. If 6 is a unit then a, @ may be called relatively prime, and 
two such numbers enjoy the characteristic property that any number pu 
divisible by a and £ is also divisible by af. This is because the equations 
= aa” = BB" and 1 = aad’ + Gf’ imply 


y= aB(a'" + Bia"), 


and the converse is equally valid, since a, G are both nonzero. 
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